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FOREWORD 


THE publication of this book reflects a belief, which I share 
with Dr McIntosh, that the scaling of teachers’ marks will 
have an established and permanent place not merely in 
educational research but in many departments of everyday 
educational practice. We used the technique on a large scale 
in our Dundee investigation on Selection for Secondary 
Education, and our experience left us both with a conviction 
of its practical possibilities—a conviction that has been 
greatly strengthened by the results of later researches by Dr 
McIntosh, Dr Walker and Mr Mackay. 

Estimates of pupils' attainments in school subjects are 
needed for a multitude of practical educational purposes; and 
it is part of my educational faith that the assessments of the 
teachers provide the best guide that is, or ever will be, avail- 
able as to the order of merit of their pupils in these attain- 
ments. But, unfortunately, the standard and scatter of marks 
vary from teacher to teacher and from school to school; and 
while teachers' assessments may be improved in this respect, 
it is extremely doubtful whether they could ever be steadied to 
such an extent that théy could be-üsed with assurance as a 
basis for the serious decisions involved in selection for 
secondary education or the award of Leaving Certificates. If 
these premises are sound, there is no escape from the con- 
clusion that teachers’ marks should be used for such purposes, 
but only if they have been scaled in such a way that the 
estimates of different teachers are comparable. 

Scaling has already been employed in practice to a consider- 
able extent, particularly in selection for secondary education, 
but one suspects that it would have been more widely used but 
for two relatively groundless fears. The first is that this is a 
tool that can be handled only by the expert mathematical 
statistician: the second is that a prohibitive amount of 
laborious calculation is involved. The present volume should 
do much to dispel these fears. It gives a simple explanation 
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of the basis of scaling, and clear and definite instructions as to 
the best ways of making the calculations. Now that such a 
work is available there should be little difficulty in the use of 
the technique by the staff of an Education Office, the head- 
master or the class teacher. 

The book is, however, much more than a statistical primer 
giving instructions for the use of methods that are already well 
known: it is an important contribution to educational re- 
search. In the hands of Dr McIntosh and his colleagues, new 
and improved scaling procedures have been evolved—pro- 
cedures that have passed the acid test of extensive use in 
actual practice. New fields for the application of scaling 
. methods are opened up; and headmasters and class teachers 
who study the authors' suggestions will find their reward in 
increased interest in their work and fuller satisfaction that 
they are carrying out their important task of adjusting the 
education they provide to the varying talents and needs of 
their pupils. 

WILLIAM MCCLELLAND 


PREFACE 


THE ideas developed in this book have their origin in 
McClelland’s “Selection for Secondary Education." Іп that 
book a clear case was presented against the use of teachers’ 
estimates for the purpose of selection unless they were made 
comparable from school to school. 

These ideas were further developed in the Report of the 
Scottish Advisory Council on Secondary Education. Here 
the scaling procedure was suggested. for use in an examination 
scheme for the award of the Leaving Certificate and, again, on 
the discussion of marks and terminal reports, it was suggested 
that school examination marks should be made comparable 
by some method not aiming at * statistical impeccability, but 
rather something reasonably valid yet easily workable by the 
busy teacher in the school." 

We have attempted to attain the Advisory Council's 
objective. Мо deep knowledge of statistics is required for 
the understanding of our methods. Chapter III on “А 
Little Statistics " will, we hope, give all that is necessary for 
the understanding of the techniques we have devised: it is an 
attempt at statistics without tears. 

We should like to emphasise also that our methods are not 
merely theoretical but have been adopted in practice both in 
education offices and in a large junior secondary school. The 
needs of the administrator and “ the busy teacher in school ” 
have been in the forefront of our minds all the time. 

We wish to acknowledge our indebtedness to Mr 
McClelland for writing the foreword, for the keen interest he 
showed in our work and for the many helpful suggestions 
which he made. Mr W. F. Lindsay, Assistant Director of 
Education for Fife, read the manuscript in its various stages 
and offered many constructive criticisms. We also received 
much help regarding the layout of the book from Mr S. 
Stewart, Educational Editor of Messrs Oliver & Boyd. For 
illustrative materials in Chapters VII and VIII we are 
indebted to Midlothian Education Committee. 
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СНАРТЕК 1 
EXAMINATIONS 


EXAMINATIONS have been for long one of the most соп- 

»troversial topics in education. There are those who hold 
Strongly that examination results are the only true 
measure of the success or failure of education and they do not 
hesitate to cut out “ frills ” such as physical training and 
music as soon as examination day is in sight. On the other 
hand there are those who claim that most of the faults of 
present day education can be traced to the examination 
System; education is a thing of the spirit and to examine 
subjects such as literature is “ as impossible as to imprison 
sunbeams." Мо doubt the truth lies somewhere between 
these two extreme points of view; examinations are certainly 
not the * Alpha and Omega of education," yet on the other 
hand * the life without examination is a life that can hardly 
be lived." 

There are few who would argue that class tests or examina- 
tion are harmful if they are not too frequent. A teacher 
must have some check whether a pupil has been profiting by 
instruction; for example, if a teacher wishes to know whether 
his pupils can add or subtract, he must give them several 
addition and subtraction sums to work, and from the results 
he can tell which pupils have a thorough grasp of these 
two operations. Each examination or test should be set with 
a definite aim in view and should be constructed to achieve 
this aim. For example, a test may be designed to measure а 
child's level of attainment or may be set with the intention of 
finding out where his weakness in a particular subject lies. 
Again, an examination to select a few clever pupils will be 
quite different from an examination, the aim of which is to 
Pick out a few less able pupils. 

Most of the criticism against examinations refers to the 
external examination. Not only has it “ cabin’d, cribb’d, 
confined ” the teacher but its results have often been beyond 
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the teacher's comprehension: pupils who should have passed 
with ease fail and some who should have failed pass. 

A great volume of research has been carried out on 
examinations. It has been shown that two examiners will 
give widely different marks to the same paper and that the 
same examiner will give quite a surprising variation of marks 
when he assesses the same pupils at different intervals. 

The following quotations illustrate the type of result 
obtained in these researches :— 

“Тһе facts of a subjective scale are well illustrated іп the 
following anecdote concerning the grading of history papers 
by a group of college professors of history in the summer of 
1920. One of the five or six expert readers assigned to a 
certain group of history papers, after scoring a few, wrote out 
for his own convenience what he considered a model paper for 
the given set of questions. By some mischance this model 
fell into the hands of another reader who graded it in a per- 
fectly bona fide fashion. The mark he assigned to it was 
below passing, and, in accordance with the custom, this 
model was read by a number of other expert readers in order 
to ensure that it was properly marked. Тһе marks assigned 
to it by these readers varied from 40 to 90." * 

French investigators dealing with essays selected three 
scripts, Nos. 23, 25 and 34, “ each of which at the original 
* baccalaureat ’ examination had been awarded 36 marks out ° 
of 80 (or 45 per cent.) and had been ranked as 24th out of a 
batch of 50. These three scripts were marked independently 
by 76 examiners. The marks for script No. 23 varied from 
4 to 52, for script No. 25 from 12 to 64, and for script No. 34 
from 16 to 65 out of a maximum of 80. The mean marks for 
the three scripts were as follows:— 

Script No. 23—25-9; Script No. 25—40:0; Script Мо. 
34-34-44, 

In an English experiment 14 examiners were asked to 
re-mark 15 history scripts some 12 to 19 months later, having 
kept no record of their previous marks. The examiners 


* Examinations and their Substitutes in the United States: The Carnegie 
Foundation for the Advancement of Teaching, Bulletin No. 28, 1936, page 64. 
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awarded not only numerical marks but the verdict of Failure, 
Pass or Credit. “It was found that in 92 cases out of the 
210 the individual examiners gave a different verdict on the 
second occasion from the verdict awarded on the first." * 
These results refer, of course, to the essay type of examina- 
tion. То avoid such variations the objective type of question 
? was devised where there is only one correct answer and this 
receives a definite mark. Examples of this type of question 
are;— 


1. Underline the word that means the opposite of the 
first word. 
Cautious—guarded, adverse, harsh, rash 
Sympathy—antipathy, passion, disrespect, courtesy 


2. Change the following sentence to the past tense:— 
The tree bears much fruit (Present) 


еее much fruit (Past) 
3. Complete the following:— ^ 
20) 
13 7117 


Objective tests have been devised chiefly for primary school 
subjects. There is a growing volume of experiment with 
tests of this type for the secondary school curriculum. In 
Appendix I we give sample questions from tests devised for 
pupils at the Leaving Certificate stage. 

The great advantage of tests devised in this way is that any 
two teachers correcting them will be sure to award the same 
mark to each script: there can be no variation in the standard 
of marking. Considerable ingenuity has been shown in the 
construction of such tests and they can be made to test not 
only attainment but a child's ability to reason. 

Тһе disadvantage of these tests is that they cannot easily be 
made to measure a pupil's ability to marshall ideas and 
set them down in an orderly sequence. This ability may 
be a very important part of the subject: in English, for 
example, composition must always play an important part 

* HARTOG & RHoprs. An Examination of Examinations, International 
Institute Examinations Enquiry, page 81 and page 15. (Macmillan, 1935). 
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even though its assessment does present considerable 
difficulties. 

The problems connected with setting and evaluating 
examinations are not primarily our task. It is necessary, 
however, to stress the desirability of improving examination 
techniques for the very good reason that it is not worth while 
to go through the scaling process with marks that are not? 
themselves good estimates of the pupils’ achievements. 
Scaling does nothing to add to the reliability of marks; its 
function is to give them meaning and to render them com- 
parable. 


| 
| 
| 
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CHAPTER II 
THE NEED FOR SCALING 


IN the Advisory Council's Report on Secondary Education* 
this challenging sentence оссигз:—“ It is important that 
the records of any school should be in a form that means 
something to another school or to an outside body 
and that the report to the parent should enlighten and not 
mislead.” Тһе implication of this statement is that at present 
marks of one school do not mean much to another school, 
and that parents get their reports in a form which leaves them 
very much ‘in the dark.’ Coming nearer home it might be 
asked whether, in a given school, class marks mean some- 
thing to all members of the staff and whether that something 
is the same thing. 

An examination mark by itself has practically no meaning; 
the traditional belief that it has an absolute value, so many per 
cent. of a possible or “ perfect ” performance, is without 
foundation. All that a mark of 60 per cent. conveys is that 
it is one of a set of marks which has a range within the limits 
zero to one hundred. Whether the mark is to be reckoned as 
“very good,” “good” or “not so good” (“ high,” 
* medium ” or * low ”) depends on how 60 is related to the 
other marks made by the class. 

When the teacher has finished marking a set of examination 
papers he has a clear idea of the value of the mark 60. If he 
puts the papers in order of merit and counts down to find the 
mark of the middle pupil he is only making explicit what was 
implicit in his mind before. If he goes further and finds the 
mark of the pupil who is one quarter of the way from the top 
and bottom respectively, he will get a still clearer picture of the 
value of 60. In the teacher’s marks book the marks теат, 
at least Гог a time, their full meaning. Later оп they may be 
transferred to the class teacher's marks register, to a head- 


* Secondary Education. А Report of the Advisory Council on Education in 
Scotland, Chap. XIII, para. 700, page 147. (H.M. Stationery Office, 1947). 
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teacher's record card and to a parent's progress card. АП 
that is usually entered on the pupil's permanent record is the 
entry * English 60.7 The collateral information that gave 
60 its meaning is thrown away and the headteacher is left to 
wonder if a pupil who has scored 60 in English and 60 in 
history has done equally well in the two subjects. He has to 
go on the assumption that the two marks are equal, although. 
only by chance will this be true. It is a serious matter that 
marks in the files of the headteacher, on which the final 
assessment of the pupil for educational and vocational 
guidance or other external purposes is based, should be so 
lacking in exact specification. 

In some secondary schools, in addition to the pupil's mark, 
the class average or the pupil's place in the order of merit is 
given against each subject in the report to parents. Even so, 
it will not be an easy task for the parent to decide how much 
better or worse 60 in English is than 60 in history. Неге is a 
term report for a pupil from a secondary school:— 


Р Class 
Subject Mark | Ауегаде 
English 61 58 
History 70 52 
Geography 70 52 
Arithmetic 46 48 
Algebra 55 56 
Geometry 71.75 63 
Science 62 63 
French 50 40 
Latin 72 68 
Art 58 57 


What is the parent to make of the fact that in this report 
the class average mark in French is 40 and in Latin 682 Тһе 
teacher of Latin knows that he has set a comparatively easy 
paper and the French teacher recognises that his paper has 
been too stiff; the headteacher may come to the conclusion 
(quite unwarrantedly) that the class is better at Latin than at 
French or that it is a way these two teachers have of marking 
which he must keep in mind when forming a judgment. The 
parent, however, taking the figures at their face value, will 
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come to the conclusion that the French teacher is a poor 
teacher. 

Furthermore, the parent would be misled if he came to the 
conclusion that his boy was doing equally well in history апа. 
geography because, the class average of the two subjects being 
52, his boy has 70 marks in each. Не would be right only in 
the odd case when the scatter of marks in the two sets was the 
same. То add à further column showing that he was ninth in 
history and fourth in geography, instead of conveying 
further information, would probably puzzle him. Even the 
best progress cards at present although they go some way to 
eliminate the misleading qualities of raw marks, do not fully 
enlighten. 

Marks in the same subject in different terms do not have the 
same meaning. For example, 60 in the first term may 
indicate a poorer performance than 60 in the third term. 
Here is an extract from a pupil's record card :— 


Ist year | 2nd year 
Term т. ШЕ | mags ps 
60 Nm 74 ЈЕ: | 2 Е 


Mark 


Class Average 59 


It would appear that the third term 60 is.a better per- 
formance than the first term 60. 

How does the second term 74 compare with the second 
year first term 79? This question can only be answered if the 
remainder of the class marks is known or at least if the scatter 
or spread of the marks is known. 

The same problems face the Director of Education who 
wishes to use Teachers’ Estimates іп a scheme of selection for 
secondary courses of instruction: it will be a very lucky 
chance if a mark of 70 in arithmetic from school X 
means the same as 70 from school Y. 

The points made in the preceding discussion may be 
illustrated by the following examples. Suppose the English 
papers of a class of 36 showed the following distributions in 
their autumn term test (A), each X representing one pupil. 
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Fic. I 
Distribution of Marks in Test A 
x 
x х х 
х x X х х 
ES РОБОМ 98 90 RS bt 
Хх хә NE ENS Ха xe 
Ххх хана dE T ХО хх 
252 30: 135” 40; 45: 50. 55 160: 65: 10" 75 
MARKS 


A pupil with 50 will be right in the middle of his class while a 
pupil with 70 will have only one of his classmates scoring a 
higher mark. . 

Suppose that the spring test (B), considerably easier or 
marked with less severity, gives the following distribution of 
marks:— 

Fic. II 
Distribution of Marks in Test B 


X 
КОХ wx 
25 947% ха Ж 
nof 4 X Ur EP Хаи У 
ро, ХЕХ M Qd toe c 
x x х х x x x x X MOT XS 
457 507 535 60 65 70 75 807 85 90 95 
MARKS 


Here a pupil with a mark of 70 will be in the middle of his 
class. Тһе two 70s have entirely different meanings. 

As the scatter of marks is the same in both tests the marks 
in the spring test may be made directly comparable with those 
in the autumn test by subtracting 20 from each spring test 
mark. On the other hand, if we want to combine the marks 
we do not need to adjust them to the same average. Takean 
extreme case: if the boy who scores 75 in the autumn (А) 
test falls to the average іп the spring (В) test he gets a total of 
145; if the “average” boy in the A test jumps into top 
place in the B test he also gets 145. Тһе higher average 
gives no greater “ weight " to the spring test—a point that is 
not generally understood. 

Suppose that in a third test (C) the marks are distributed 
as follows:— 
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Ко. Ш 
Distribution of Marks т Test С 


x 
x 
о NT Ж 
SCORE a Жу ХАН 
Хе э Жы ЖЕ 
ы WC X хх XIS 
ES EU SSK KS 
Жез K OC ехе Хе Ж SS 
357 40071457 50) (55! O 65 


MARKS 


Тһе average score of 50 in C is the same as in A, but the 
scatter of marks is much less—the ratio is roughly 30:50. 
When it comes to combining the three tests, one effect of this 
shrinkage of scatter in the last term is to penalise the boy who 
came out top in C by comparison with his classmates who 
who were first in A or В. Being highly placed in the test 
scored by the marker of restricted range pays a poor dividend 
when the annual class list comes out. To illustrate this 
effect let us suppose a boy was first in А and dropped to the 
average place in C. Compare him with his opposite 
number who jumped, we shall suppose, from average place in 
A to top in C. The former is credited with a total of 125 
and the latter with 115. This is obviously unfair; to make 
things fair the C marks should have been stretched out from 
the centre both ways so as to increase the range of C to 50. 

The general principle is that the greater the scatter of marks 
the greater the weight of the mark. Differences of scatter, 
generally overlooked, are just as important as differences of 
average in interpreting marks. Discrepancies in both should 
be ironed out if marks are to be compared or combined. 
When the day comes that a headteacher or Director of 
Education finds on his desk only marks that have been ironed 
out or adjusted or scaled to have the same average and 
scatter, he will then be able to make well-informed judgments 
on the standing of the pupils. 

The extract from a pupil's record card on page 7 shows how 
widely apart class averages may be in the ordinary run of 
School marks. Would we find similar differences in scatter? 
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Тһе marks shown in diagrammatic form in Fig. IV came from 
a Group B class in a secondary school. The class average 
marks were controlled to some extent by exhortation; the 
teachers were asked to aim at an average round about 56. 
No attempt was made to adjust any realised discrepancies. 
The broken lines show the vagaries of the class average. 


It jumps up and down as we pass from subject to subject. A ° 


measure of scatter may be taken as the difference between the 
pupil a quarter of the way from the top of the class and the 
pupil a quarter of the way from the bottom of the class, and 
this is shown by the length of the verticallines. This length 
varies from subject to subject and shows how unequal the 
scatter of marks has turned out. Clearly, it would pay a boy 
with his eye on his yearly average mark to score well in 
mathematics and arithmetic, while the best endeavours of the 
practical boy depending on his benchwork mark would avail 
him little. 

It can be computed in this series of examinations that 70 in 
English is really just as high as 83 in arithmetic. or 75 in 
science and no higher than 68 in history, 61 in geography and 
62 in art. 

Unquestionably, raw marks are in no fit state to be either 
compared, compounded or interpreted. It is the business of 
the scaling process to make the necessary transmutations. 
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Ев. IV 
Illustration of Differences т Mean and Scatter of School Marks 


т ze z 
: о a PE = 2 a 
> о x > I^ осла 
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90 
Е 
НЕ: 
80 
Е i 
H 
Lr г a 
70 i 
60H - - 
50 
H 22255: 
| 40 i 
| Ч 
| Я 
| | ЕЕ 
30H 
Hu 
mum 
| H H Е 
| 20 1 
| --- EXTREME MARKS 
| Н БЕЕН 


СНАРТЕК ІП 
A LITTLE STATISTICS 


AN аројогу—“ To those unfamiliar with the recent course 
of educational psychology the intrusion of mathematical 
refinements into problems of the classroom may seem 
to be little short of perverse pedantry, more ridiculous and less 
excusable than the crotchets of the trigonometrical tailot who 
fitted Gulliver with a suit of clothes by means of a sextant and 
atheodolite. Examining children, it may be urged, is among 
the simplest of the teacher's daily duties; and needs little else 
but common sense and rule of thumb: by many words, still 
more by many figures, counsel is darkened . . . . 


* While Swift was gibing at the mathematicians of Laputa, 
Newton was writing his Principia. The satirist may be read 
the more widely: but the mathematician has more pro- 
foundly changed and furthered the course of civilisation." * 

This chapter is introduced to give the non-mathematical 
readers some idea of elementary statistical terms. Lest they 
take fright we would remind them of Shepherd Dawson's 
claim that “ the calculation of statistics is a matter of mere 
arithmetic and the understanding of their general significance 
requires little more than common sense." 


DISTRIBUTION OF MARKS 


After he has marked a set of examination papers a teacher 
generally arranges them in order, the highest mark on top and 
the lowest at the bottom. With the papers thus arranged, he 
can readily give each pupil an ‘ order of merit’ or a ‘ rank." 
Неге аге the marks іп an English examination for a class of 
20 pupils :— 

84, 78, 74, 72, 66, 66, 65, 59, 56, 56, 54, 54, 49, 
49, 47, 43, 41, 41, 40, 32. 

* Sır CYRIL Burt. Mental and Scholastic Tests, page 142. (Staples Press 

Ltd., 1947). 
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The pupil who scores 72 is fourth in order of merit or has 

the rank 4, the pupil with 59 is eighth or has the rank 8, and so 
on. Some difficulty is experienced with the mark 66. The 
teacher generally states that the pupils with this score are 
fifth equal; the statistician would give each the rank 5-5. 
If the sixth, seventh and eighth pupils from the top have the 
same mark, each pupil is given the rank 7. 
' The results of an external examination dealing with some 
hundreds of pupils can hardly be treated in this way. Іп such 
cases the marks are generally arranged in groups or “ class 
intervals." А simple case is that illustrated by Fig. I, page 8. 
For ease, we illustrated the case where one pupil scored 25, 
two pupils 30, three pupils 35, and so on. This is a most 
unlikely result in any examination. If, however, the one 
pupil had made a score between 21 and 25, two a score 
between 26 and 30 and so on, we have a much more likely 
result. The group of marks 21-25 is known as the class 
interval. Тһе number of cases in a class interval is known as 
the frequency. ` 

The frequency distributions for tests A, B and C are shown 
below. 

TABLE 1 


Frequency Distribution of Marks for 
Examinations А, B and С 


Frequency 
Class Interval X B с 
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One of the advantages of this method of recording marks is 
that it is possible to obtain from the distribution a rough idea 
of how the test has spaced out the candidates. For this 
reason the distribution is frequently called the score scatter. 

This may be further illustrated by the use of graphical 
representations of score scatters. For example, the dis- 
tribution of marks shown in column A in Table 1 is shown 
as follows:— > 


Fic. У 


Diagrammatic Representation of Frequency Distribution А 


21-25 26-50 31-55 36-40 41-45 46-50 51-55 56-60 61-65 66:70 71-75 
MARKS 


The above diagram is known as a histogram. 

It is interesting to think out the types of distribution that 
are most suitable for examinations of different types. For 
example, an examination designed to select the most able 
pupils should have a score scatter of the first type shown 
opposite. К 

It is then much easier to decide which аге the best candi- 
dates; the bunching of the pupils at the foot of the scale does 
not matter. 
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Frequency 


0 0 0 30 40 % 60 70 80 90 100 
MARKS 


Conversely an examination for a certificate to be awarded 
to three-quarters of the candidates should have a score 


scatter of this type:— 
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Here the important part of the mark scale is that dividing 
the lower quarter of the candidates from the upper three 
quarters. 

The devising of examinations to produce these score 
scatters is a separate subject, on which we shall not enter. 
We strongly recommend that for each test or examination the 
teacher sets he should find the distribution of marks. 


MEASURE OF STANDARD—THE AVERAGE OR MEAN 


The standard of marking is simply expressed by the mean or 
average mark. For example, the standard of marking in 
examinations А and C is the same because the mean or 
average in each case was 50. Examination B was marked on 
a much easier standard as the class average or mean mark 
was 70. 

Where the number of marks is small the mean or average is 
easily calculated by adding them and dividing by the number 
of marks. Where the number of marks is large it is usually 
better to prepare a score scatter as shown in the previous 
section and to employ the method of calculation shown in 
Appendix II. 


MEASURES OF SCATTER OR SPREAD OF MARKS 
(a) Range 


One simple method of indicating the scatter of marks in a 
test is to give the difference between the largest and smallest 
marks. This is known as the range. Іп the three examples 
above, the ranges are 50, 50 and 35 showing that examination 
C gave the smallest spread of marks. Тһе range as a measure 
of scatter is of little use when the marks are not evenly dis- 
tributed. For example, the range of 50 obtained in test А 
would be drastically altered if the mark of a single pupil were 
altered from 75 to 90. 1 


(b) Interquartile Range 


The upper quartile, generally denoted by О., is the mark 
below which three quarters of the candidates fall and the lower 
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quartile, denoted by Q,, is the mark below which one quarter 
of the candidates fall. Тһе difference between these marks 
is the interquartile range. 

For example, in Table 1, the upper quartile for examination 
А is the mark below which 27 of the candidates fall. This 
mark must lie somewhere between 56 and 60 because 30 


"candidates have scores less than 61 and 26 candidates have 


scores less than 56. It can be shown to be 57. Similarly the 
lower quartile is in the range 36-40 and can be shown to be 39. 
The interquartile range is therefore 57 - 39 —18. 


(c) Interpercentile Ranges 


The idea of the quartile may be extended to cover not only 
quarters but also hundredths. The mark below which lies 
80 per cent. of all the marks is known as the 80th percentile. 
In general, a percentile is the mark below which lies a certain 
percentage of all the marks. 

It will be obvious that the 25th percentile is the lower 
quartile, and the 75th percentile is the upper quartile. The 
50th percentile, below which lies half of the candidates, is 
called the median. In this book we shall often use the inter- 
percentile range from the 16th to the 84th percentile as а 
suitable measure of spread. 

We denote the xth percentile by Р,; for example, the 80th 
percentile is denoted by Pso. 

We regard the percentile of such importance that we feel 
justified in showing how it is calculated. We take the fre- 
quency distribution А from Table 1. 

We shall demonstrate each step by calculating Рао, the 80th 
percentile, from Table 2. 

The first step is to find the cumulative frequencies corres- 
ponding to each class interval. These are shown in column 
F, Table 2. Тһе cumulative frequency (F) column is рге- 
pared by adding the successive class frequencies from the 
bottom to the top. The cumulative frequency corresponding 
to the interval 36-40 is found by adding the frequencies 
1+2+3+4=10 and gives the number of scores in the 
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distribution lying below the score 40-5. (Note.— The class 
interval 36-40 is regarded as extending from the lower limit 


35-5 to the upper limit 40-5). . 
Similarly for class interval 41-45 the cumulative frequency is 


1+2+3+4+5=15. 


То Calculate a Given Percentile from Grouped Data 


TABLE 2 
Calculation of Percentiles in a Frequency Distribution 
| Class Cumulative Percentage 
Interval Frequency Frequency Cum. Freq. 

(c) (f) (F) (% Е) 
71-75 1 36 100 
66-70 2 35 97.2 
61-65 3 33 91.7 
56-60 4 30 83.3 
51-55 5 26 72.2 
46-50 6 21 58.3 
41-45 5 15 41.7 
36-40 4 10 27.8 
31-35 3 6 16.7 
26-0 2 3 8.3 
21-25 1 1 2.8 

| N-36 | 
Any percentile may be calculated from the formula:— 
xN-F 
Px =] Jem xc 


where xN — Percentage of N the total frequency 
1 - lower limit-of the class interval in which Px lies 
Е = cumulative frequency of class interval immediately 
below / 
f - number of scores within the class interval in which 
Px falls 
c=the size of the class interval. 


To calculate Pso. 80 
ХМ =—~ x 36 =28:8 


100 
7. Рао falls within class interval 56-60 
1=55:5 f-4 
F=26 с=> 
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7. Pao =55:5 к = х5 
=55:5 +3:5 
2-59 

This means that 80% of the pupils іп this examination 
scored marks below 59. 

In some scaling procedures it is an advantage to look at 
percentiles and their corresponding scores the other way 
round and to find the percentile corresponding to a selected 
score, instead of the score for a selected percentile; for 
example, to find from grouped data the percentiles corres- 
ponding to the upper limits of class intervals 25-5, 30:5, 35-5 
and so on. This can be very quickly done for the whole 
series of upper limits if a column of percentage cumulative 
frequencies is added to Table2. From the Table we see that 
there are 21 scores out of a total of 36 below the value 50:5; 
stated differently 58-3 per cent. of scores lie below 50:5 ог 
50:5 = Pss.s 

(d) Standard Deviation 

The most useful measure of the scatter of marks is the 
standard deviation. It is often denoted by the Greek letter 
с. Ап example is perhaps the best way by which to illustrate 
what the standard deviation really is. Take the following 
marks scored by ten pupils, 84, 78, 72, 60, 56, 50, 49, 47, 43 
and 41. The mean score is 58. Тһе deviations of each score 
from the mean (the difference between each score and the 
mean) аге 26, 20, 14, 2, -2, -8, -9, -11; -15 and -17. 
These deviations are squared to give 676, 400, 196, 4, 4, 64, 
81, 121, 225 and 289. The sum of the squared deviations is 
2,060 and therefore the mean squared deviation is 206. Тһе 
square root of this is 14-3 and is the standard deviation. 
From the above example it can be seen that the standard 
deviation is the square root of the mean of the squares of the 
deviations from the mean. i 

The calculation of the standard deviation for a frequency 
distribution with any class interval is given in Appendix III. 

The standard deviations for marks A, B and C are 12-1, 12-1 
and 8:6 respectively showing that the scatter of marks given by 


с 
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teachers A and B is the same and each is greater than that 
given by teacher C. 
THE NORMAL FREQUENCY DISTRIBUTION 


We have shown how distributions of different kinds are 
most suitable for examinations of different types. There is 


one frequency distribution which is of great importance; . ` 


it is called the normal frequency distribution. As an 
American has said, it is the lathe on which the statistician 


makes most of his tools. Take the following distribution of - 


IQs:— 


TABLE 3 
Normal Distribution of 105 


Class Interval | 55- | 65- | 75- | 85- | 95- | 105- | 115- | 125- | 135- 
с 64 |74 |84 |94 |104 | 114 | 124 | 134 | 144 


Frequency 1 4 11 21 26 |21 11 4 1 


Тһе histogram of this distribution would be as shown іп 
Fig. VI. : 

If a curve is drawn through the mid points of the top of each 
box, it will give a well-shaped symmetrical curve known as the 
normal curve. The curve is symmetrical about the mean of 


Бю. VI 
Histogram of Normal Distribution of IQs 
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55-64 65-74 75-84 85-94 95-104 105-114 115-124 125-134 135-144 
IQs 
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the distribution. Тһе second feature of the normal dis- 
tribution is that the scores concentrate closely around the 
mean and taper off equally on either side; there is:a slow 
decrease in the frequencies for a certain distance above and 
below the mean, a quicker decrease for a bit and then a slower 
decrease to the limit on both sides. (See Fig. VII). 


Measuring from the Mean 


Perhaps we should indicate at this point that the statistician 
finds it convenient to measure from the mean and not from 
zero. Тһе reason for this is evident if we consider finding the 
mean of the following numbers—86, 89, 90, 92, 94, 95. By 
adding them and dividing the total, 546, by 6, we find that the 
average is 91. It is much simpler, however, to subtract 90 
from each, transforming the numbers into —4, -1, 0, 2, 4, 5, 
giving a total of 6 and a mean of 1. When 90 is added again 
the mean would be 90--1-91. This device is often used 
unconsciously. Golfers, for example, often count their 
Scores by measuring from their probable average number of 
strokes for each hole; two above 4s for nine holes would 
mean a score of 38. 

Another device adopted by the statistician is measuring 
from the mean in units of the standard deviation or sigma 
units, as they are called. For example, if the mean IQ of 
the group of children is 100 and the standard deviation is 15, 
an IQ of 130 would be +2c in sigma units because 130 is 
30 above the mean, which is +290, thatis +20 in terms of the 
standard deviation of 15. Similarly, an IQ of 80 15 – 1:33с in 
sigma units. 

In the diagram below, the distribution of IQs is shown in 
Steps of 15; below the line are shown the corresponding 
sigma measurements from the mean 0. 


35 70 85 100 115 130 145 
- 30 – 20 —с 0 +o +20 +30 


Figure VII shows how the area is divided up at 10 
intervals, correct to the nearest percentage. 
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Relationship between the Standard Deviation and the Area 
under the Normal Curve. 


Fic. VII 
Area under Normal Curve 


-5с -2с Че“ 0с- *lo- *2c +30 


For the purpose of this book it is convenient to think of the 
percentage of the total area as being cut off by an ordinate that 
moves from left to right. For example, the ordinate at -20 
cuts off approximately 2% of the area, the ordinate at — 1c 
cuts off 16% (2% -- 596 +9%) of the area, and so оп. If we 
think of Fig. VII as representing the distribution of 100 scores 
in perfectly normal fashion, then there are two scores (out of 
100) below -20. Another way of stating this is that the 
second percentile (P2) corresponds to -2о, Р to –1а, 
P50 to 0, Рз; to + 1e, and so on. 

If a normally distributed set of marks had a mean of 50 and 
a standard deviation of 10, then P, would correspond to 30 
marks, Р to 40, Ра to 50 and Ps: to 60. 

We have shown the relation between sigma scores and 
percentiles; for example, Ps4 corresponds to +1о. Тһе 
statistician has constructed tables giving all the percentiles 
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| іп terms of sigma units. One such table is given in Appendix 
| IV from which the following figures аге extracted :— 


Standard 
Deviation Percentile 


y ‘There are some figures in this table to which special 

attention must be drawn: – 1с cuts off approximately 16% 

| of the scores from the bottom, and +1е cuts off 16% of the 

| scores from the top. If we can find the two scores that are 

16% from the top (Pss) and from the bottom (P16) respectively 

| and subtract them, we get the range of scores that is equal to 
| twice sigma. 

For our benefit the statisticians have prepared another 

table which shows the sigmas and percentiles the other way 

| round (Appendix V). From this table the following figures 


\ һауе Бееп ехїтасїей:— 
| 

| Standard 
Percentile Deviation 
1 —2.326 
5 — 1.645 
10 — 1.282 
| 50 0:000 
| 70 4-0:524 


If we know Фе mean and standard deviation of a normal 
distribution, we can calculate the scores corresponding to any 
percentile from Appendix V. For example, if in a normal 
distribution the mean is 50 and the standard deviation is 10, 
then 


Ps =50 -10 x 1:645 
-33:55 
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Тһе reader is invited to check the following figures:— 


Рі =26-74 Pop =73-26 

РЕ 598325 Pos =66:45 

Pio =37-18 Poo =62:82 | 

P2 -41:58 Рао = 58:42 

Рао -44:76 Ро-55:24 

Ро =47:47 Poo =52:53 
Pso =50-00 


If ordinary graph paper is used with the horizontal axis 
denoting percentiles and the vertical axis scores, we get a 
curve known as an ogive. (See Fig. VIII). 

If specially ruled paper* known ав Permille ог 
Arithmetical Probability Paper is used the percentiles do not 
form a curve but a straight line. 

To draw this line, of course, only two points are required. 
We need hardly point out the great saving in time and labour 
achieved by using Permille paper. To draw the normal 
line we might calculate Ра and Раб and draw the line 
joining these two points. (See Fig. IX). 

When the distribution is not normal, percentiles calculated 
in the usual way and plotted on this special paper will not be 
on one line: the points will form when joined what is known 
as a zig-zag (see Fig. X). 

Тһе table on page 28 gives in percentile form the distribu- 
tion of IQs in a secondary school. i 

Тһе zig-zag is shown in Fig. X. 

If the best fitting line is drawn among these points and the 
IQ values of Ps, and Pi; are read off, we have a good 


* Data Sheet No. 37. Arithmetic Probability. (Wightman Mountain, Ltd., 
Artillery House, Artillery Row, Westminster, S.W.1). 


Data Sheet No. 37. Permille Paper. (Hunt & Broadhurst, Ltd., Ideal 
Works, Botley Road, Oxford). 


This is a form of graph paper in which the vertical lines are spaced, from 
the central line (P59), at distances proportional to the sigma values of the 
percentiles (Appendix V). For example, Рі is —1o from Ро, Poo is 4-1-286, 
from Ps, The horizontal lines are equally spaced as in ordinary graph 
paper. 
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Percentile 
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approximation to the standard deviation of IQs in the 
School. In this case 


Ра: =115 
Р-95 
Ра — Рав =20 =20 
0 10 
‚ А Ps4 + Рав Ж) 
Тһе mean will be approximately созү = 1(115 +95) 

-1x210 
-105 


This chapter will have attained its limited objective if it has 
put into the hands of the reader such of the elementary tools 
of the statistician as are needed for scaling and has shown him 
how to handle them. 


СНАРТЕВ ТУ 
FROM PHYSICAL ТО MENTAL SCALING 


SIR CYRIL Burt has stated the difficulties in interpreting 
examination marks as follows:— When we turn from 
physical measurements to those of mental capacities or 
attainments a difficulty confronts us. In comparing 
physical magnitudes, we assume that all observers are using 
the same scale: if a French doctor takes a temperature 
with a Centigrade thermometer while his English colleague 
records the result in Fahrenheit, we make the necessary 
conversion before the two readings are compared. 
Similarly, in dealing with marks obtained in an examina- 
tion we must also take into account possible differences of 
Scale."* The two doctors know their respective zero-points 
and units. Two examiners, however, wishing to compare 
marks have no fixed points on their scales as the doctors 
have; they have to devise methods of inferring these from 
the whole run of the marks. . Once they have found their 
zero-points and units they can convert in the same way as 
doctors do with temperatures. Let us digress for a little 
to see how the doctors might have done their conversions. 

The quickest way would be to consult one of those 
thermometers that have the two scales lying on opposite 
sides of the mercury column. In the absence of such a 
thermometer or a wall chart or a ready made conversion 
table, they might fall back on the formula they had learnt 
at school, The formula forgotten, they would devise a 
Solution in some such way as this: they would note (1) 
that the interval of temperature between the freezing point 
and boiling point of water is divided into 100 units on the 
Centigrade instrument and 180 on the Fahrenheit, and (2) 
that the effective zero-point of the Centigrade scale is 0 
and the Fahrenheit 32. 


* HARTOG, RHODES & Burt. Marks of Examiners. (Macmillan & Co., 1936). 


29 


30 THE SCALING OF TEACHERS МАВК$ 


Boiling Point 
100*C. 2122Е, 
Centigrade Fahrenheit 
Scale | Scale 
SURG е slt M ? А 
0°С. 32°F, 


Freezing Point 
If they wish to convert 30° Centigrade into the correspond- 
F-32 30-0 

180 100 

each expression being an equal fraction of the interval 
between the boiling point and the freezing point, the 
Big Unit of Temperature or B.U.T. as we may call it. 
By simple algebra:— 

180 


кы Paap Ж Уа 
100 ~ 


ЈЕ -2 х30 +32 
=86 
In'effect they would be using the well-known formula— 


9 
Е ==С +32 
< + 


ing Е degrees Fahrenheit they would write 


where C is the temperature in degrees Centigrade. 

This formula is an instruction to the English doctor: ,to the 
zero of the Fahrenheit scale (32) add the Centigrade 
reading stretched out in the ratio 9:5. 

The formula is all very well for the odd occasion; if 
several conversions have to be made at the same time a 
table or graph is much speedier. 

To construct a conversion table substitute in the formula, 


E -5С +32, values of Centigrade from 0 to 100. 


ЕКОМ 
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Centigrade Fahrenheit 


32 
33.8 
35.6 


212 


31 


The graph is constructed by drawing the straight line join- 


ing the points (0°C, 32°F) and (100°C, 212°F). 


To convert 


30°C. to Fahrenheit, follow the vertical AB till it meets the 
graph at B and the horizontal line BC till it meets the 
Vertical axis at C which gives the Fahrenheit reading. 


Fic. XI 


Converting Centigrade to Fahrenheit by Graphical Method 
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It would have complicated matters for the doctors if a 
Russian had been present with his Réaumur scale (freezing 
point 0, boiling point 80). Figure XI enabled us to convert 
from one scale to another. Тһе three doctors would have 
required three conversion graphs to compare their temper- 
atures; obviously it would be much more convenient to 
compare all three at the same time. This can be done if 
there is some common scale against which each can be 
measured. There is a common scale at hand—that based 
on the interval between the boiling point and freezing point, 
the B.U.T. Figure XII below shows the conversion chart. 
Conversion can be made up any vertical line; for example, 


Fic. XII 
Conversion Chart for Fahrenheit, Centigrade and Reaumur 
Temperature 
240 
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40°R. corresponds to 50°С. and to 122°F. Again 80°С. 
corresponds to 62°R. and 176°F. 

Had a fourth doctor come along with yet another scale, 

freezing point 50 and boiling point 150, he could have been 
accommodated on our diagram and his readings scaled to 
that of any of his colleagues. 
» There is, however, a kind of thermometer that cannot be 
treated in this way. Let us imagine a “Hottentot " doctor 
who scales his own thermometer without taking into ac- 
count the boiling point and freezing point of water. He 
simply divides his tube into 100 equal divisions, putting 0 
just above the bulb and 100 near the top. His instrument 
will be quite useful up to a point; he can go round his 
hospital registering the temperature of each patient, and so 
put them in an “order of merit" of temperature. After 
finishing his round he can examine his layout or distri- 
bution of temperatures and come to the conclusion that 
certain patients have temperatures so much above the 
average, or so far below the average, that they require 
special attention. He would find possibly that, say, 56 was 
the average. To complete the illustration, let us suppose 
that the “ Hottentot”? doctor throws away his ther 
mometer each night. With a new thermometer of a differ- 
ent bore and size of bulb, he will have to go through the 
same performance again, carefully examining his distri- 
bution of temperatures in order to single out each day the 
dangerous cases. His normal reading will be quite 
different except by chance. 

The story of the “Hottentot” doctor is a near parable of 
the procedure which the teacher must adopt, willy nilly, 
in marking his papers. Не may construct an examination 
paper in arithmetic with his “zero” at 0 and his “тпахі- 
mum?" 100 and, just as the “Hottentot”? doctor can put his 
patients in order of merit of temperature, the teacher can 
arrange his pupils in order of merit in arithmetic noting 
Which have a high and which have a low achievement. 
He, too, might infer that 56 was a normal or average mark 
but a mark of 56 in itself will be meaningless to teachers of 
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any other subject. The next day he might set another 
examination in arithmetic. Again he will have to scrutinise 
the marks to find out the average mark. If the new average 
were found to be 56 it would be purely a chance occurence. 

The English, French and Russian doctors could 
convert their readings because they had a common scale, 
the B.U.T., against which they could compare their 
temperatures. Teachers of arithmetic, English and 
mathematics in the same school can only compare their 
marks if there is some common scale or objective standard 
against which their marks can be measured. In the same 
way a Director of Education must have some objective 
standard with which he can compare, convert or scale the 
marks in arithmetic from different schools. 

Тһе educationist, like the scientist, must devise a formula, 
table or graph to convert or scale marks so that they have a 
meaning not only for himself but for others. 


| СНАРТЕВ У 
THE SCALING PROCEDURE 


| WE һауе seen that an examination mark by itself has по 
theaning; the standard of marking and the scatter of the 
marks must be known, that is, the mean and the standard 
deviation of the distribution of the marks must be known. 
| The fundamental principle in scaling is to make the 
| standard and scatter of marks іп one set of marks the same 
| as the standard and scatter in another set. For example, 
teachers' estimates in arithmetic from a number of schools - 
may be scaled on an external test in arithmetic with the 
results that the mean and standard deviation of the esti- 
mates in each school become the same as the mean and 
standard deviation of the externaltestscores for that school. 

No attempt will be made to establish the formula for 
scaling a series of marks but a parallel may be drawn with 
the conversion of Centigrade into Fahrenheit temperatures 
which will give the non-mathematical reader some idea of 
its derivation. 

When a Fahrenheit thermometer and a Centigrade 
thermometer are placed in the same dish of water, they give 
different readings because they employ different scales. 
It was shown in Chapter IV that these two readings are 
related to each other by the equation. 


Е-32 EZO 
180 100 
or Е -2С +32 


i This equation was derived from a knowledge of the fixed 
points and units of each scale. 

In examination marks our fixed point is the class average 

and our unit the standard deviation of the class marks from | 

the average. When marks are converted to this scale, we 
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can compare marks in different examinations, or we can 

compare examination marks with teachers’ estimates. 
Suppose, for example, that a pupil has scored 58 in an 

examination where the mean was 51 and the standard 


сз БОЛ -0-5. Sup- 


pose that in another examination applied to the class he 
scores 78, the average being 70 and the standard deviation 
78 -70 


deviation 14. His standard score is 


16. His standard score is =0:5. Our conclusion 


is that his performance is equally good in each examination. 
To generalise the numerical argument, let us denote the 
mark obtained by our candidate in the first examination by 
X, the average mark by X, and the standard deviation by 
‘cx; similarly Y, Y and cy represent the mark, average 
mark and standard deviation in the second examination. 
Then the numerical relation 
58-51 78-70 
Wi по 
X-X Y-Y 
ох Di сү 
This equation tells us what mark X in the first examina- 
tion corresponds fo the mark Y іп the second examination 
and vice versa. To find the mark X corresponding to a 
given Y we should write the equation in the form 


becomes 


X =X +2 (у y) 
os 


We can then calculate for each value of Y the cor- 
responding value of X. This is called scaling Y on X, and 


the equation is called the scaling equation. “Х may be 


n H pod 
termed the scaling ratio or the adjusting ratio. 
If this operation were carried out on the marks previously 
mentioned, where the average was 70 and the standard 


deviation 16, the new or “scaled” marks would have an 
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average of 51 and a standard deviation of 14. This is the 
essential point about scaling; it is an operation for chang- 
ing the average and standard deviation of a set of marks to 
a new average and standard deviation. 

One important use of this operation is in scaling teachers' 
estimates on the marks obtained by the class in an external 
examination. The need for scaling these estimates was 
pointed out in Chapter П. Опе way of scaling them is to 
give them the same average and standard deviation as the 
marks obtained by the class in an external examination in 
the subject for which the teachers' estimates were prepared. 
Тһе Scaling Equation will be:— 


Ts Mg -2ҚТ- My) 
Qon 


where Та =Scaled teachers’ estimate. 
My = Mean test mark. 
сұ =Standard deviation of marks. 
ст =Standard deviation of teachers’ estimates. 
T =Teachers’ estimate. 
М, = Меап of teachers’ estimates. 


It will be obvious that the estimates for a class which has 
performed well in that examination will be adjusted to a 
corresponding level of excellence and that any tendency 
of the teacher to bunch estimates will be counteracted. A 
point that may not be clear at first sight, but which will 


TABLE 4 


Estimates and Marks in English for 20 Pupils 
= 


Examination Examination 
Pupil | Estimate Mark Pupil | Estimate Mark 
A 40 47 K 48 56 
B 45 54 Т; 70 72 
(е: 65 66 м 45 49 
р 50 41 ч 45 56 
Е 60 #59 о 60 78 
Е 65 74 Р 40 43 
G 80 84 Q 60 66 
H 35 40 R 35 32 
I 42 49 5 40 41 
1 45 54 T 60 65 


E 
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become clearer when the following examples аге studied, 
is that scaling does not alter the order of merit of the 
pupils as given by the teacher: the estimates are adjusted 
in size but the final order in the scaled estimates is exactly 
that of the original estimates. 

There are given in Table 4 the teachers’ estimates of 
attainment in English for each of the 20 pupils in a class, 
and the score made by that pupil in the external examina- 
tion in English. These estimates will be scaled on the 
marks by two methods. 


I. ARITHMETICAL METHOD (SIGMA SCALING) 
From Table 4 above, the following are calculated. 
(Note:—the full calculation is given in Appendix VI). 

Average of teachers’ estimates =51-5 

Average of examination marks = 56:3 

Standard deviation of marks 1412 
Standard deviation of estimates 

If the unscaled estimate is T, and the scaled estimate Ts; 
the scaling equation is 

Ts =56:3 +1-12 (T - 51:5) 

The application of this equation to the unscaled estimates 
will produce scaled estimates with an average of 56:3 and 
the same standard deviation as that of the marks. 

The equation, when simplified, becomes:— 

` Ts =1-12T - 1:4 

On applying this equation to the estimates given above 
we obtain the scaled estimates shown in the fourth column 
of Table 5 below. For example, the first estimate 40 
becomes 1:12 x40 – 1:4 =43 to the nearest whole number, 


Possibility of negative scaled estimates 


Suppose in another example the equation was: 
Ts =1-5Т -21 
А teacher's estimate of 14 would give a scaled mark of 0 
and all estimates below this would result in negative scaled 
estimates. For example, a teacher's estimate of 10 would 
give Tg = -6. 


Scaling ratio — 
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While a negative mark would not disturb the mathe- 
matician, the normal class teacher would hesitate to award 
a pupil a mark of - 6 for English. 

The occurrence of negative marks is one of the objec- 
tions to the method of scaling by means of the scaling 
equation. 


П. First GRAPHICAL METHOD. 


McClelland in his “Selection for Secondary Education" 
gives a method which avoids the creation of negative 
marks. This system may best be illustrated by the 
example which we have used above with the scaling 
equation. The vertical axis denotes teachers’ estimates 
and the horizontal axis the external examination marks. 
Four points are plotted :— 


Fic. XIII E 
Graph for Scaling Teachers Estimates. 
90 Т ЕН 
во E- ЕЕН TH 
HEH 
70 m 
8 EH р E 
B 60 Ez 
ui s ЕН 
5 Н 
40 
Ні 1. 
1 
30 40 50 60 70 80 90 


Marks апа Scaled Estimates 
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L—the lowest estimate (35) plotted against the lowest 
mark (32). 

Qi—the lower quartile of the estimates (41) plotted 
against the lower quartile of marks (45). 

Qs—the upper quartile of the estimates (60) plotted 
against the upper quartile of marks (66). Я 

H—the highest estimate (80) plotted against the 
highest mark (84). 


The line LQiQsH shows the approximate relation 
between estimates and marks for theclass. Тһе process of 
scaling consists in forcing the estimates into general 
agreement with the marks. Тһе horizontal axis refers 
both to marks and to scaled estimates. An estimate P is 
scaled by following the line PR, RS to obtain the scaled 
estimate S. 


The scaled estimates shown in the fifth column in Table 
5 are obtained by this method. 


TABLE 5 
Estimates for English Scaled by Calculation and by 
Graphical Method 
Scaled estimate 
pO A "ӨЙ 
Pupil | Mark Estimate |Calculation Graph 
A 47 40 43 43 
В 54 45 49 49 
с 66 65 71 70 
р 41 50 55 55 
Е 59 60 66 66 
Е 74. 65 71 70 
с 84 80 88 84 
н 40 35 38 32 
1 49 42 46 46 
7 54 45 49 49 
к 56 48 52 53 
І, «72 70 77 75 
м 49 45 49 49 
N 56 45 49 49 
о 78 60 66 66 
ip 43 40 43 43 
Q 66 60 66 66 
R 32 35 38 32 
S 4l 40 43 43 
т 65 60 66 66 
жат а т == =! к O 
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One disadvantage of this method is that ап isolated very 
low—or very high—score or estimate exerts an unduly great 
influence on the scaled estimates in that quarter of the 
graph. The method is a rough form of a more precise 
method which will now be described. 


III. SECOND GRAPHICAL METHOD (PERCENTILE SCALING) 


We use the following example to illustrate this more 
precise method of scaling. 

An external test in English was set to several schools 
presenting candidates for the Senior Leaving Certificate. 
The results of this test were used to scale the teachers’ 
estimates from the various schools. Table 6 gives the 
frequency distribution of the scores and estimates from 


school E. 
TABLE 6 


Frequency Distribution of Scores and Estimates 
in English in School E 


Class Test 
Interval Scores Estimates 

85-89 — - 
80-84 = - 
75-79 6 — 
70-74 7 5 
65-69 8 7 
60-64 10 10 
55-59 9 15 
50-54 8 15 
45-49 6 5 
40-44 3 3 
35-39 2 - 
30-34 1 -- 
Total 60 60 


As the test was set to all schools it gives a common 
standard against which to compare the estimates. Тһе 
principle involved in this scaling procedure is that two 
Scores of the same percentile standing are equivalent; for 
example, a pupil who has a Percentile Rank of 70 in the 
estimates is given, as his scaled estimate, the score of the 
pupil who has a Percentile Rank of 70 in the Test. In this 
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way the estimates are forced to conform to the same 
standard and scatter of marks as those of the scores made by 
pupils in that school. Corresponding scores are read off 


from а graphical picture whose construction we detail 
below. 


TABLE 7 


Calculation of Percentiles Sor Upper Limit of Class Intervals 
of Scores and Estimates in English in School E 


Test Scores Estimates 

1 2 3 4 5 6 
Class 
Interval i c.f. % c.f. f. c.f. ее СНВ 
75-79 6 60 100 
70-74 7 54 90.0 5 60 100 
65-69 8 47 78.3 7 55 91.7 
60-64 10 39 65.0 10 48 80.0 
55-59 9 29 48.3 15 38 63.3 
50-54 8 20 33.3 15 23 38.3 
45-49 6 12 20.0 5 8 13.3 
40-44 3 6 10.0 3 3 5.0 
35-39 2 3 5.0 
30-34 1 1 1.7 


Step 1—Calculate the cumulative frequencies (c.f.) for test 
Scores and teacher's estimates—columns 2 and 5, Table 7. 
Step 2—Change these into percentage cumulative fre- 
quencies (% c.f.)J—columns 3 and 6. (Columns 3 and 6 


give the Percentile Rankings of the upper limit of scores in 
the interval). 


Step 3—Draw a graph formed by plotting the points, 
percentile against test score (1-7, 34-5; 5:0, 39-5, сіс.). 

See Graph А. Fig. ХІУ. 

Step 4—On the same sheet draw the graph formed by 
plotting percentiles against estimates (5, 44-5; 13-3, 49-5, 
сіс.). 

See Graph В. Fig. ХІУ. 

Step 5— Convert teacher's estimates into scaled estimates 
by finding on Graph A the mark which has the same 
percentile standing as the estimate on Graph B. For example, 
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an estimate of 61 on Graph B and the corresponding point 
on Graph A each corresponds to a score of 66. Each of 
these marks has a percentile standing of 69. 

Instead of using ordinary graph paper Permille paper might 
have been used (Fig. XV). It will be noted that almost 
identical results will be obtained. 

In practice to save constant reference to the graphs and 
possible error, a conversion table would be constructed, 
Showing the scaled estimate for each raw estimate. The 
scaled estimates are read off to the nearest whole number. 


TABLE 8 


Extract from Conversion 
Table for School E 


CHAPTER VI 


SCALING TEACHERS' ESTIMATES—SOME FURTHER 
POINTS 


Tw the previous chapter methods of scaling teachers' estimates 
were described. In this chapter it is proposed to discuss the 
problem in some of its wider aspects. 

Perhaps the best known use of the scaling procedure is in 
schemes of transfer of pupils from primary to secondary 
education where the teachers are asked to submit estimates in 
English and arithmetic which are scaled on the results of an 
external test or examination in these same subjects. The 
Scottish Advisory Council on Education have outlined a 
scheme whereby the award of the Leaving Certificate may be 
based on teachers' scaled estimates. 

Care must be taken that marks from the same school have 
thesamesignificance. For example, a large school with three 
classes in English being presented for the Leaving Certificate 
will probably have three different teachers engaged in teaching 
these pupils. If all three teachers submit their estimates with- 
out any attempt at co-ordination the marks may as well be 
considered as coming from three different schools and must be 
scaled separately. If, as is desirable, a single scaling is to be 
made for the school one teacher or a team of teachers must be 
made responsible for co-ordinating the marks from the school. 

Secondly, in selection for secondary education it has been 
found better to scale subject on subject; that is, estimates in 
English should be scaled on marks in English, and arithmetic 
estimates on arithmetic marks. It is economical of time and 
labour to scale the total estimate in English plus arithmetic 
on the total mark in these subjects, but the findings of the 
McClelland investigation were that the predictive power of 
the estimate was weakened by this merging of data. 

/ Тһе suggestion has been made that teachers' estimates in 

English and arithmetic might be scaled on intelligence 
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quotients. The attraction of this suggestion lies in the en 
that it renders the external examination unnecessary for 
scaling purposes. The results of the investigation just, 
mentioned show that a great deal of the predictive power of 
the scaled estimate is lost if this course is adopted. Ав will .« 
readily be seen, the method would not allow for differences in 
the attainment of a class in English and in arithmetic. 

Thirdly, there is a limit placed on the feasibility of the 
scaling process by small numbers in a school. The scaling 
calculation previously described can be made when there are 
only a few candidates, but the assumptions underlying the 
procedure would no longer hold. The scaling methods 
described yield trustworthy results in general only when the 
number of pupils in the group is greater than sixteen ог, рее 
still, twenty. If the number is smaller, it may still be possible | 
to scale successfully when the estimates and marks fulfil 
ceríain conditions, but this is a matter still open to experi- 
ment. In our Chapter XI on “Scaling in the Classroom," 
we show a method which gives at least a tentative solution of 
this problem. 

It might be profitable to draw attention to the help that 
scaled estimates can give to a teacher. If estimates are scaled 
on standardised marks of the type to be described in 
Chapter VIII, and the scaled estimates are made available, as 
they ought to be, to the teacher concerned, he can at once see 
in what respect he has been successful or otherwise. Fora 
teacher fresh to the work of the particular stage of primary or 
secondary education, this information might prove very 
valuable. There is the hope, too, that further practice, with 
knowledge of the results at each attempt, will enable teachers 
to prepare more accurate estimates each year. 


THE DIFFERENCE BETWEEN SIGMA AND PERCENTILE SCALING 


It is with some reluctance that we introduce this topic as a 
full consideration of it would demand from the reader 
statistical knowledge beyond that given in the elementary 
outline in Chapter III; notwithstanding, we feel that, for 
completeness, it is necessary to refer to the matter briefly. 
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Readers who have closely followed the examples and who 
have possibly worked out examples of their own will have 
observed that the results obtained by different methods of 
scaling may not be exactly the same. In part, the differences 
are caused by the approximations made in reading graphs, 
and in rounding off figures in calculations, but there is a more 
fundamental difference between the sigma, or equation, 
method and the percentile method. 

With the sigma method, the raw estimates or marks are 
made to conform with the marks in the external examination 
in two respects only, namely, in average and in scatter. 
Suppose, for example, that a set of estimates is distributed in 
either form shown on page 15 and that the external marks, on 
which they are to be scaled, are distributed normally. The 
use of sigma scaling will alter the average and scatter of the 
estimates but will preserve the lopsided character of the 
distribution. With the percentile method, the raw estimates - 
or marks are made to agree with the external marks in all } 
respects; the distribution of scaled estimates becomes identi- 
cal with that of the external marks on which they are scaled. 
In the above example the use of percentile scaling will remove 
the lopsided nature of the estimates and produce scaled 
estimates which are normally distributed, as were the external 
marks, and have the same average and scatter as these marks. 

The choice of scaling method is determined partly by the 
nature of the estimates or marks provided, and partly by the 
distribution of scaled estimates or marks desired. For the 
promotion stage, we favour the use of the sigma method 
on the grounds that teachers’ estimates usually require ad- 
justment only in standard and scatter, being distributed in 
passably normal fashion and that is the form in which we wish 
to have the scaled estimates. At the Leaving Certificate 
stage, on the other hand, the percentile method is recom- 
mended because the teachers’ estimates are, for various 
reasons, far from normal and because there is a cluster about 
the 50 mark which must be removed to produce scaled estim- 
ates giving greater discrimination at that critical point. 


СНАРТЕВ УП 


ТНЕ STANDARDISATION OF EXAMINATION 
MARKS IN PRACTICE 


Ir has been shown that the distribution of examination marks 
should depend on the purpose of the examination. In 
examinations used for selection for secondary education it is 
preferable, though not essential, to have normally distributed 
marks. The word “normally” is here used in the sense of 
Chapter Ш. One reason is that these marks are often used 
along with intelligence quotients, and these, when the 
'candidates are unselected and reasonably numerous, are 
usually distributed normally. 

The marks obtained in examinations in English are fre- 
quently distributed almost in normal form, but in arithmetic 
it is sometimes found that the distributions are far from 
normal. It is possible to transform these "raw" marks into 
“standardised” marks which are normally distributed. What 
is more, the mean and standard deviations of the new marks 
can be adjusted to suit the examiner’s requirements. This 
process of transforming raw marks, so that they (i) become 
normally distributed, (ii) have an agreed mean, and (iii) have 
an agreed standard deviation, is here called “standardisation.” 

In the example which follows, the agreed mean has been 
fixed at 65 and the standard deviation at 15. These figures 
have been chosen for the following reasons:— 

(i) Intelligence quotients (as obtained from Moray House 
tests) are standardised to have a standard deviation 
of 15. 

(ii) The number of scores above 100 in a normal distri- 
bution with this mean and standard deviation is very 
small; this fits in with the usual conception of 100 as 
the maximum of the scale. Іп practice there may be a 
few marks over 100. 
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(іі) The number of Scores below 50 is about 16 per cent. 
of the total group, which again agrees roughly with 
the usual practice of teachers. 


It must be made clear that standardisation is an operation 
to be applied to the scores of a whole county or city. Тһе 


can be maintained year after year; a standardised mark of 60 


exactly the same as 60 in 
the following year, provided that the number of candidates 15 
reasonablylarge. А teacher who is asked to provide numeri- 
cal estimates of attainment in English and in arithmetic has 


; She knows or can be informed 


the arithmetic or English sco; 
is 85 (i.e., 120 -35) while that 
Instead of attempting an e 


те appropriate to an IQ of 120 
for an IQ of 90 is 55. у 

хрјапаноп of the mathematical 
ess, we shall illustrate by an 
experience of standardisation. 
Scores of the 1,379 pupils whose 


below. The desired mean is 65 
and the standard deviation 15, 


EXAMPLE oF STANDARDISATION 

Step 1—Ргераге a table $ 
marks grouped in class in 
vals? and freq 


howing the frequencies of the raw 
tervals of five. These class inter- 
uencies are shown in columns 2 and 3 of Table 9. 


1 The more usual intervals are 0-4, 5-9, 10-14, etc, 
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- Step 2—In column 1 enter the mark dividing each class 
interval from the one preceding it. This mark is denoted by 
the symbol x. 


Step 3—Calculate the cumulative frequencies shown in 
column4. Each gives the number of pupils scoring up to but 
not more than the corresponding value of x. The column is 
built up from the foot—70 -48 +22, 91 =48 +22 +21, and 
so on. 

TABLE 9 
Distribution of Arithmetic Marks of 1,379 Pupils 


(D (2) (3) OR (5) 
Boundary Class Cumulative| 

[63] Interval | Frequency | Frequency |Percentage 

100.5 96-100 86 1379 100 

95.5 91-95 99 1293 93.8 

90.5 86-90 93 1194 866 | 
81-85 79 1101 79.8 
76-80 97 1022 74.1 
71-75 90 925 67.1 
66-70 86 835 60.6 
61-65 98 749 54.3 
56-60 122 651 47.2 
51-55 83 529 38.4 
46-50 86 446 32.3 
41-45 70 360 26.1 
36-40 53 290 21.0 
31-35 45 237 17.2 
26-30 46 192 13.9 
21-25 32 146 10.6 
16-20 23 114 83 
11-15 21 91 6.6 
6-10 22 70 5.1 
0-5 48 48 35 


Step 4—Convert the cumulative frequencies into percent- 
ages—column 5. The percentage of pupils who have scored 
less than 95-5 is now shown to be 93-875, andsoon. Thelast 
column therefore gives a percentile rating for each of the 
boundary marks listed in the first column. 


Step 5—Draw a graph showing the relation between the 
marks and the percentile ratings. From this graph (shown 
in Figure XVI) it is possible to read off the percentile rating of 
any mark not shown in the table, and vice versa. 
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Fic. XVI 
Percentile Curve of 1,379 Arithmetic Marks and Normal Percentile Curve. 
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We must now Prepare a similar graph for a normal dis- 


tribution with mean 65 and standard deviation 15, This is 
done as follows:— г. 


‘score, 7 а ~ 2 
е, i.e., the difference from the mean measure 
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m 6—Prepare the first column of Table 10 by writing 

mean p at intervals of three above and below the chosen 

th оҒ65. Тһе figure three is selected as being one-fifth of 
е standard deviation. 


9 ж + 

tep 7--Еог each mark enter the corresponding standard 
d in units of 
Sta: EE 

Eo rd deviation. For example, 68 is 3 marks or 0:29 
3515 ӨНЕ mean and the standard score is therefore +0:22; 
is -2 marks or 2e below the mean, and the standard score 

Eoo 
TABLE 10 


Percent? ae yp 
ercentiles for Normal Distribution with 
Deviation 15 


Mean 65 and Standard 


Standard Standard 
Mark Score Percentile || Mark Score Percentile 

19 2.6 99.5 62 -02 42.1 
M 24 99.2 59 -04 34.5 
o 22 98.6 56 - 0.6 274 
б 20 97.7 53 -0.8 21.2 
59 1.8 96.4 50 — 1.0 15.9 
86 1.6 94.5 47 — 1.2 11.5 
83 1.4 91.9 44 -1.4 8.1 
80 1.2 88.5 41 -1.6 5,5 
77 1.0 84.1 38 -1.8 3.6 
74 0.8 78.8 35 -2.0 2.3 
2 0.6 72.6 32 -22 1.4 
68 0.4 65.5 29 -24 0.8 
6 0.2 57.9 26 -2.6 0.5 
5 0.0 50.0 23 -28 03 


the percentile rating 


Step 8—From Appendix IV read off 
This is entered in the 


in mE to each standard score. 
ов these percentile г р 
P3 nding marks on the graph previously drawn (Fig. XVD. 
Arst n percentile rating there аге по\ two marks, the 
o paned from the original set of marks, and the second 
Sandisk normal distribution. The second mark is the 
TN ised mark corresponding to the first or raw mark. 
SENT ipe a raw mark of 90 becomes à standardised mark 
» both having the percentile rating 86. The process of 


atings opposite the cor- 
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reading off the standardised marks in this case is shown 
diagrammatically by the arrows. 


Step 10—To save time and possible inaccuracy in repeated 
reading of the graphs, prepare from them a standardisation 
table (Table 11) showing for each raw mark the standardised 
mark. 

It should be noted that although this calculation was based 
on grouped scores, the final table operates for each mark 


TABLE 11 
For Converting Raw Marks in Arithmetic into Standardised Marks 


Raw Std. | Raw Std. | Raw Std. | Raw Std. | Raw Std. 
Mark Mark | Mark Mark | Mark Mark | Mark Mark | Mark Mark 


9 41 
88 80 | 68 68 | 48 57 | 28 47 | 8 40 
E ОО си || Лу mu [Eno Жу о 
86 78 | 66 67 | 46 56 | 26 46 | 6 38 
E 210 6: (6 ЕД5 56 25 46: ||| 5 зв 
84 77] | 64 66 | 44 55 | 24 45 | 4 37 
БЕЗЕ 15 80% 
2 wb Jem. ||) 
ex. mE о КСС а) Ol ү я 

| 0 29 


from 0 to 100. Тһе raw score of each of the 1,379 candi- 


dates sitting the arithmetic examination can now be trans- 
formed to the standardised score. 


STANDARDISATION USING PERMILLE PAPER 


Тһе work of standardisation can be greatly eased by the use 
of Permille paper. This paper, as already stated, is so con- 
structed that the curve of percentile ratings for the normally 
distributed marks becomes a straight line. Table 10 is no 


longer required in its entirety; only two points are required 
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for the straight line. If the reader cares to plot the re- 
maining points in Table 10 on this chart he will find that the 
points fall on the straight line. А further advantage is that 
the conversion at the upper and lower extremes of the scale 
becomes more accurate and much easier. 

The first four steps of the standardisation process remain as 
before, but the work is again made easier at the ends of the 
scale. Тһе subsequent steps аге:-- 

(5) Graph the percentile ratings against their corresponding 
marks, using Permille paper. Join the points by a series of. 
straight lines or by a smooth curve. Note that on this paper ” 
the 100th and Oth percentiles are off the scale. (Figure XVII). 

(6) Calculate, using Appendix IV, the two percentile ratings 
corresponding to the marks one о unit above and one c unit 
below the mean. For example, a mark 80 corresponds to 
4 1e with a percentile rating 84-1 and a mark 50 corresponds 
to -lø and percentile rating 15:9. 

(7) Plot these two ratings against their marks on the Permille 
paper. Draw the straight line passing through the two points, 
extending it at each end. 

The standardised marks may now be read from the graph as 
- before, and a standardisation table constructed. The reader 
will observe that Figures XVI and XVII yield practically the 
same standardisation table. 

In practice the standardisation of examination marks is 
most readily done by using Permille paper and we strongly 
recommend its adoption. 
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CHAPTER VIII 


PROMOTION FROM PRIMARY TO SECONDARY 
EDUCATION 


Тнв Advisory Council in their Report on Primary Education 
outline a scheme for the transfer of pupils from primary to 
secondary education. It is recommended that the main basis 
for selection should be two intelligence tests, standardised 
attainment tests in English and arithmetic, and teachers* 
estimates of attainment in English and arithmetic. This set 
of six scores is sometimes referred to as the “battery” of tests. 

The teachers’ estimates must be “properly scaled so that 


the standard from school to school may be comparable.” ! 


The final mark, of each pupil is found by taking the average 
of his scores in the intelligence tests, attainment tests and of 
the teacher's scaled estimates. This mark is known as the 
summary mark. 


THE WEIGHTING OF TESTS IN THE PROMOTION BATTERY 


ТЕ the scores in these tests are added in their "raw" state, 
each test has a weight proportional to its standard deviation. 
Thus, if the English and arithmetic marks have standard 
deviations of 10 and 15 respectively, the arithmetic mark 
has one and a half times the weight of the English mark 
when these are combined. McClelland found in his investi- 
gation that it was impossible to state exactly the weight that 
should be given to each of the tests in the battery. He re- 
commended that equal weight should be given to each. In 
practice, not a great deal of difference is made if the raw marks 
of the tests are used, provided that the standard deviations of 
the tests are not too greatly different, but the following 
method gives the tests exactly equal weight. 

Intelligence tests of the Moray House type are constructed 
to give a standard deviation of 15 for an age group. If the 
tests in English and in arithmetic are each standardised, by 
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the process described in Chapter VII, to have the same 
standard deviation of 15, then the teachers’ estimates, which 
are scaled on the examination marks, will also have a 
standard deviation of 15 in each of these subjects. All six 
measures have now the same standard deviation, and carry 
equal weight when they are added. 


OUTLINE OF A PROMOTION SCHEME 


A promotion scheme making use of the methods outlined 
above incorporates the following processes. 


(a) The administration of two intelligence tests to each 

pupil. 

The intelligence quotients can be obtained by testing each 
year а two year group, including pupils in Primary ТУ and 
Primary У. This has the additional advantage that any 
indisposition of the pupil is not likely to affect more than one 
test. 


(b) The administration of tests in English and in arithmetic 
to each pupil. 
The administration of the tests in English and arithmetic 
is a normal feature of Scottish educational procedure and 
does not need further explanation. 


(c) The standardising of the examination marks in English 

and arithmetic. 

The method of standardising these marks has been ex- 
plained in Chapter VII. 

(d) The scaling of teachers’ estimates in English and 

arithmetic. 

We favour the sigma method of scaling teachers' estimates 
which has been described in Chapter V. 

(e) The calculation of the pass mark. 

The average of the six me: 
the given pupil. 
with a mean and 
The recommend 
pass mark for a 
be fixed at the s 


asures is the summary mark for 
These sumrhary marks form a distribution 
standard deviation which can be calculated. 
ation of the McClelland Report is that the 
dmission to a senior secondary course should 
ummary mark which is 0-70 times the standard 
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deviation above the mean, and this mark can therefore be 
calculated.* А second and more tentative recommendation 
is that the borderline between the ordinary secondary course | 
and a modified course for the weaker pupils should be about | 
one standard deviation below the general mean. This mark | 
can also be calculated from the summary marks. ! 


THE OPERATION OF А PROMOTION SCHEME 


While it is easy to indicate in a few sentences how each step 
is carried out, the operation of the scheme ina city or county 
is, of course, a much more laborious business. It might 
therefore be interesting to give a more detailed description of 
the process actually employed in a county operating the 
Scheme. 

For each pupil, there was prepared in Primary IV a card on 
‘which were entered the usual particulars, e.g., name, date of 
birth, school. After the first intelligence test the intelli- 
gence quotient (101) was recorded on this card; similarly 
the second intelligence quotient (ТО) when obtained was 
also entered. For both of these entries the headteacher was 
responsible. 

Prior to the examination, the headteacher entered on each 
card the teachers’ estimates in English and in arithmetic. 
These were placed in the boxes marked (a) and (b) in Figure 
XVIII under the heading "raw." The cards were then sent to 
the Education Office where the estimates were recorded in 
summary form, and preliminary steps taken to scale the 
estimates. 

When the cards were returned to the schools and the 
examinations had been held, the marks obtained were entered 
in boxes (с) and (d), under the heading “ raw." The head- 
teacher entered on the card his final recommendation regard- 
ing a suitable course, the parents' choice, and any notes, and 
the cards were returned to the office. 


* This recommendation was based on the results obtained in a city, and on 
the courses being provided there. Тһе Report goes on to suggest that further 
inquiry should be made into suitable passmarks for different types of area and 


ifferent courses. 
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From the distribution of marks obtained from the cards, 
‘the office staff standardised the marks in English and arith- 
metic to a mean of 65 and a standard deviation of 15 
in each subject and entered the standardised marks in English 
(Qe) and arithmetic (Qa) in boxes (e) and (f). Тһе teachers' 
estimates were then scaled on the standardised marks, and the 
scaled estimates in English (Tes) and arithmetic (Tas) 


Fic. ХУШ 
Record of Qualifying Data 
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entered in boxes (в) and (В). These processes were also 
executed in the education office. 


It will be observed that the six measures 101, IQ», Qe, 
Tes, Qa, Tas, thus lay in a column on the edge of the card 
making summation easy, and facilitating also а compari- 
son of the various measures so that discrepancies might be 
investigated. 

The average of the IQs was 100 and that of the remaining 
marks was 65. If the six measures were added and the total 
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divided by six, the resulting mark would lose some of the 
significance that each of the components had. If a figure of 
70, i.e., 2(100 -65) were subtracted from the total before 
division, then the average of the summary marks so adjusted 
was 65. This procedure of subtracting 70 before division 
was therefore adopted, and the resulting summary mark is 
the “adjusted average" shown in Fig. XVIII. 

А similar argument and process could not be applied to 
standard deviations. Each of the six measures had a standard 
deviation of 15, but the standard deviation of the summary 
marks obtained from them need not have that value, in fact it 
is always less. Іп one year in which this scheme was applied, 
the standard deviation of the summary marks was approxi- 
mately 13. The upper pass mark was therefore 65 +0:70 x 13 

=74, and the lower pass mark 65-1 x 13 =52. These figures 

are only approximate, and are quoted to demonstrate that 
the pass marks obtained by this method have a general 
similarity to the percentage marks of the class teacher. 


BORDERLINE CASES 


It might be well to emphasise here that this battery of tests, 
although the best found in the McClelland investigation, does 
not have perfect predictive power. Its findings must not be 
tigidly applied, and borderline cases must receive special 
consideration. The recommendation of the headteacher, 
the wish of the parent and other factors must be taken into 
account before a final decision is made. | 

Тһе next step in the scheme described in this section was 
therefore a study of the cases where the recommendations of 
the battery, of the headteacher and of the parent were not in 
agreement. Although those considering these cases were able 
to formulate some general principles, these were of an experi- 
mental nature, and are therefore not stated here. This field 
15 one which is ripe for experiment. 


СНАРТЕК ІХ 
THE LEAVING CERTIFICATE STAGE 


THE awarding of a certificate to pupils leaving school has 
occupied the minds of educationists both in England and in 
Scotland in recent years. Attacks on the nature of the 
School Certificate in England and the Senior Leaving Certi- 
ficate in Scotland and the method of awarding them have not 
been uncommon. We do not wish to enter the lists either 
for or against the method of awarding these certificates but 
it is clear that any method which relies solely upon the results 
of an external examination is bound to be unjust to many 
pupils. It is equally clear that teachers’ estimates, unless 
they have been scaled, are just as unsatisfactory. We feel 
strongly, however, that the teacher’s order of merit is the one 
which must be accepted and must therefore be taken into ac- 
count in the award of any such certificate. 

As our experience has been in Scottish schools we have 
drawn our illustrations from the Senior Leaving Certificate 
but the principles we lay down are applicable to any external 
examination. 

The Advisory Council on Education in Scotland in their 
report on Secondary Education have suggested new principles 
governing the award of the Leaving Certificate and have out- 
lined а method of examination which involves the use of the 
scaling procedure. A clear distinction must be made 
between the nature of the certificate and the nature of the 
examination. We here are concerned, not with the recom- 
mendations regarding the nature of the certificate, but only 
with the method of examination suggested. 

Briefly, the proposal is to award the certificate on the 
teachers’ estimates which would be scaled on the results of 
objective tests. The estimates in any subject would be scaled 
on the results of 
scaling procedure would be carried out school by school. 
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the objective test in the same subject and the 
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The fundamental assumption underlying this proposal is 
that the teacher's order of merit is most likely to be the correct 
one. Educationists generally will agree that after four or 
five years? experience, a teacher should be able to assess 
а pupil's attainment in a subject much more accurately 
than does a single written external examination. The scaling 
Procedure is introduced only to ensure that a uniform 
Standard of marking is maintained between schools. Scaling 
does not alter the teacher's ranking of his pupils. 


TABLE 12. 


The Raw and Scaled Estimates of 13 Pupils 
Selected from Schools A and B. 


S 5 Е 
caled Estimate Р Scaled Estimate 
School A | Estimate. | School B 

53 47 37 

55 48 38 

56 49 40 

60 50 44 

63 51 48 

64 52 52 

65 53 55 

66 54 57 

68 56 61 

73 58 65 

77 60 69 

81 65 76 

83 70 82 


In Table 12 we give the teachers' estimates in English of a 
number of pupils selected from two schools along with the 
Corresponding adjusted estimates scaled on the results of an 
Objective test in English. The scaled estimates did net alter 
the order of merit although there is an alteration in the size 


of the marks. 
For the Senior Leaving Certificate teachers are asked to 


give a mark of 50 to a pupil who will just pass. This estimate 
Of 50 is therefore a critical mark. 

Let us look more closely at what happens in schools A and 
B to the estimates round about 50. The following diagram 
illustrates what happens to estimates from 47 to 53 in each 
School. 
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Scaling has produced some striking results; for example, а 
pupil estimated at 47 in school A has a scaled mark of 53 
while a pupil with this same estimate in school B has a scaled 
mark of 37. Тһе estimated pass mark of 50 becomes 60 in 
School A and 44 in school B. Teachers will find it difficult 
to believe that such different standards of marking occur. 


Fic. XIX 
L Results of Scaling Leaving Certificate Estimates from Schools A and B 
SCALED ESTIMATE TEACHERS RAW SCALED ESTIMATE 
SCHOOL А. ESTIMATE SCHOOL в 
70 ГО 70 
бО — 60 
50-- 2-50 
40-- = —40 
30 30 30 


The diagram also shows that the order of pupils does not 
alter. For the group of pupils with marks between 47 and 53, 
the pupils with 47 in both schools still have the lowest marks 
while the pupils with 53 have the highest scaled marks. 

It must also be clear that a pupil’s mark in the external 
test does not determine whether that pupil *passes" or 
“fails.” Бог example, in school A the pupil estimated at 58 
scored a low mark, 42, in the test but this test score has no 
effect on the pupil’s rank. It is only used in the calculation 
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of the mean and standard deviation of the test scores for 
school А. Тһе teacher's order of merit or ranking remains 
unchanged and when the pass mark is fixed, all pupils above 
that mark pass and those below fail. Again in school A, the 
pupil estimated at 48 scored a relatively high mark, 50, in the 
test but this does not “boost” his scaled estimate. 

This method of taking the teacher's order of ranking as the 
correct one and adjusting his standard of marking has much 
to recommend it. There seems little justification, for 
example, for the cases which occurred in a school where a 
pupil estimated at 62 failed and another estimated at 38 
passed. Such utter disregard for the teacher's judgment is 
enough to bring any system of examination into disrepute. 
Under no circumstances should a certificate be awarded 
solely on the result of a single written examination. 


THE Pass MARK 


The Advisory Council suggest two methods of fixing the 
pass mark :— 
(a) by a group of inspectors or examiners making a careful 
study of the results of the external test and the teachers 
estimates for a number of representative classes; 


(b) by fixing the pass mark at a certain multiple (to be 
determined by experiment) of the standard deviation of 
the marks of all candidates in the external test from the 
mean of the marks of all candidates in the external test. 


The first method is subjective, depending to a large extent 
On the examiners’ attitude. Research has shown clearly that 
there can be no guarantee that the standard fixed will be 
uniform from year to year by such a method. р 

Тһе experimental methods of fixing the pass mark, while 
Possessing the attributes of objectivity, are by no means easy 
to work out in practice. In selecting pupils for secondary 
education it is possible to carry out a follow-up investigation 
and compare the examination marks at the primary stage with 
the actual achievement in the secondary school. By such a 
method it is possible to find the mark which must be scored 


| 
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before a pupil has a fifty-fifty chance of success in the second- 
ary school. This is not possible at the Leaving Certificate 
stage as the pupils leave school and there is no obvious 
criterion of success. 

The pass mark may. be fixed by determining in advance the 
percentage of pupils who are to pass. It is then an easy 
matter to find the mark which will cut off the fixed percentage 
of successful candidates. Objection to this procedure may be 
taken on the grounds that the calibre of the pupils may vary 
from year to year. This is not likely to occur when the 
sample of candidates is drawn from schools over the whole 
country. Ч 

Technical methods of fixing the pass mark can be justly 
criticised but it must be remembered that the non-technical 


methods can be subjected to the same criticisms and several 
others besides. 


OBJECTIVE TESTS AT THE LEAVING CERTIFICATE STAGE 


Many criticisms have been made of the suggestion that 
objective tests should be used at the Leaving Certificate stage. 
These criticisms have been answered in the Advisory Council's 
report and there is no need to repeat them here. In Ap- 
pendix I we have given examples of experimental types of 
these tests. While it is agreed that objective tests do not 
measure the power to marshall ideas or to assess literature, 
we suggest that the written examination is at best a poor 
measuring rod of these abilities. 

Another objection to the use of objective tests is that 
teachers would coach for them and the broader aspects of 
education would be lost. While it is admitted that the poor 
teacher would be misled into attempting to raise the school 
mean to obtain a large number of passes, we have confidence 
in the profession as a whole that they would still retain their 
ideals and vision. In the second place, with greater experi- 
ence in the construction of these tests, it will probably 
become more and more difficult to coach successfully for 


them; the full possibilities of the Objective tests have not yet 
been realised. 
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SCALING LEAVING CERTIFICATE TEACHERS’ ESTIMATES 


An experiment was conducted in a number of senior 
secondary schools in which all pupils sat an objective test in 
the subjects for which they were presented for the Leaving 
Certificate. We demonstrate the scaling technique from the 
results in English in one of the schools. 

The technique adopted is essentially the percentile method 
shown in Chapter V. The steps in summarised form are as 
follows :— 

Step 1—Find the frequency distribution of the teachers’ 
estimates and objective test scores. 

Step 2—Calculate the percentiles for the upper limit of the 
class intervals. If the scatter of the estimates is not suffi- 
ciently great, additional percentiles may have to be calculated 
to give the line of percentiles. 

Step 3—On Permille paper draw the line of percentiles of 
the objective test scores. 

(See Graph A, Figure XX). 

Step 4—On the same sheet draw the line of percentiles of 
the teachers’ estimates. 

(See Graph B, Figure XX). 

Step 5—Convert teachers’ raw estimates into scaled estim- 
ates by finding on Graph A the marks which have the same 
percentile standing as the estimates on Graph B. For 
example, an estimate of 60 falls on Graph B at L and the 
corresponding point on Graph A is at M which corresponds 
to a score of 71. Each of these marks has a percentile 
standing of 80. 

Step 6—Prepare a conversion table from which can be read 
off any scaled estimate corresponding to any raw estimate. 


SMALL GROUPS 
Perhaps the most serious criticism of the scaling procedure 
is that it cannot be used when the number in the group 15 
small. It must be borne in mind that at the Leaving Certifi- 
cate stage in small schools and in subjects such as Latin, the 


number of candidates presented is generally few. 
Е 
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In Chapter XI we show how the scaling technique can be 
used with groups down to ten pupils. 

Where the number of cases is less than ten, the teacher's 
standard of marking can be assessed to some extent by a 
careful study of the estimates, the test scores and the 
«data on which the teachers have based their estimates. 
Our expenience has shown that some basis for adjusting 
marks is possible when only a few candidates are under con- 
sideration. The relation between estimates and marks in the 
external test is often apparent when the corresponding estim- 
ates and marks are plotted on graph paper. Even with four 
or five marks, the examiner who has acquired an intimate 


Extract from Conversion Table 


Estimates in English 


knowledge of how the scaling operation has worked over a 
large number of schools, can often give a good approximation 
to the alteration which must be made on the estimates. We 
do not propose to discuss this point in detail as our researches 
are still incomplete. We can say, however, that this. neces- 
sarily less precise method of comparing estimates and marks 
is more satisfactory than basing an award on the result of a 
single external examination set to pupils in a small school or 
department of a school. 


FUNDAMENTAL PRINCIPLES 


At the risk of wearying the reader we would draw his 
attention to the following fundamental principles which have 
been demonstrated in the above example:— 


MARKS 
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In Chapter XI we show how the scaling technique can be 
used with groups down to ten pupils. 

Where the number of cases is less than ten, the teacher's 
standard of marking can be assessed to some extent by a 
careful study of the estimates, the test scores and the 
-data on which the teachers have based their estimates. 
Our ехремепсе has shown that some basis for adjusting 
marks is possible when only a few candidates are under con- 
sideration. The relation between estimates and marks in the 
external test is often apparent when the corresponding estim- 
ates and marks are plotted on graph paper. Even with four 
or five marks, the examiner who has acquired an intimate 


Extract from Conversion Table 


Estimates in English 


knowledge of how the scaling operation has worked over a 
large number of schools, can often give a good approximation 
to the alteration which must be made on the estimates. We 
do not propose to discuss this point in detail as our researches 
are still incomplete. We can say, however, that this. neces- 
sarily less precise method of comparing estimates and marks 
is more satisfactory than basing an award on the result of a 
single external examination set to pupils in a small school or 
department of a school. 


FUNDAMENTAL PRINCIPLES 


At the risk of wearying the reader we would draw his 
attention to: the following fundamental principles which have 
been demonstrated in the above example:— 
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(a) the teacher's order of merit is not altered by scaling. 


(b)the pupil's score in the test is used only in the con- 
struction of a percentile graph; it does not determine 
whether the pupil passes or fails. 


(c) the teachers' estimates in any subject are scaled on the 
results of a test or examination in the same subject. 


(d) the scaling operation must be carried out school by 
school. 


TEACHERS’ ESTIMATES 


We should like to draw attention to Graph B in Figure XX. 
It will be seen that it “flattens out" after the mark 50 has been 
reached. This means that there are many pupils with marks 
just at or above the 50 level. At this critical 50 point the 
distribution of marks of 41 out of a group of 104 pupils was 
as follows :— 


Graph A, on the other hand, keeps the same slope all the way 
down. This is a persistent characteristic of Leaving Certi- 
ficate estimates which makes them awkward to scale. 
Teachers tend to “ give a pupil his chance.” If he is lucky 
on the day of the examination he may score above the pass 
mark in the external examination and be credited with a pass. 
In addition, of course, most of the certain failures have been 
excluded from the examination. Scaling will be more accur- 
ate if teachers spread their marks at the lower end of the 
scale and if all pupils at this stage are included. 

At the upper percentiles, Graph A is above Graph B; the 
pupils score higher marks than estimated by the teachers. 
This is another common feature of teachers’ estimates at the 
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Leaving Certificate stage. Teachers tend to err on the safe- 
side and undermark the abler pupils. 

If the scaling procedure were to be adopted, teachers 
would require to give greater care to the distribution of their 
estimates. They would have, however, the satisfaction of 
knowing that greater attention would be paid to these estim- 
ates and that no freak result of an external examination would 
decide the fate of one of their charges. 


СНАРТЕВ Х 


CLASSROOM SCALING—ASSIGNING CLASS 
MEAN MARKS 


A PARALLEL to the following incident must be within the 
knowledge of many teachers in secondary schools where it is 
the habit to allow unadjusted marks to go out to the public. 
Two boys, equally-presentable in manners and appearance, 
applied for a job in a drawing office. One was a boy who 
was on the point of completing his three years’ technical 
course (class mean IQ 110) and who had marks on his 
record card for the third year in the creditable 55-59 region. 
The other was a “C” group (class mean 10 90) boy of small 
natural ability but very hard working. Under the pressure of 
strong parental encouragement he did well among his intel- 
lectual equals. The tests set to his class, based on their own 
schemes, were relatively easy and the marks comparatively 
high; his were among the highest. The conventional high 
marks for “С” pupils and moderate marks for “А” pupils 
were well understood within the school; both were considered 
to act as incentives. Аб {һе drawing office, naturally, both 
sets of marks were taken at their face value and the “С” boy 
was given the job. The result was inevitable: in a short 
time, all concerned, parent, employer and boy, found that 
the work was beyond his powers, 

Clearly the time is ripe for exercising some control over 
the marks to be recorded for “С” pupils so as to bring them 


into some relationship to the marks recorded for pupils of 
known higher ability. 


It would be educational folly to suggest as a method of 


control the setting of a common examination to all classes in 
the same year of study, a practice which can be adopted in 
highly selective senior Secondary schools. In a school with 
pupils drawn from a wide range of intellectual ability where 


classes are following courses suited to the “age, ability and 
72 
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aptitude" of the children, the examination for the “А” pupils 
would be quite beyond the capabilities of the “С” pupils. 
Some system of adjusting or scaling the examination marks 
within the school must be adopted. 

Again, a teacher of a boys' section may have class marks in 
science with a mean of 65 while the teacher of the girls’ 
section, which has the same general ability level, may have a 
class mean of 51. Each teacher may stoutly claim that his 
pupils have “earned” their marks. Some adjustment of the 
marks must be made in this case also if the marks of the two 
sections are to be made comparable. 

The question whether these adjusted marks should be used 
only in the school records or on the pupils’ record cards is 
one for the philosopher and not the scientist. 

Within a school, class marks in the various subjects are made 
comparable by scaling. Comparability is achieved, as before, 
by making the mean and the scatter of the marks of each 
subject equal to some assigned mean and scatter. Two as- 
sumptions are made. Firstly, that class ability in any one 
subject is equal to that in any other; for example, class ability 
in English is assumed to be the same as the class ability in 
mathematics. Secondly, the variation of ability in one 
subject is equal to that in any other; for example, the range of 
ability in English is assumed to be the same as the range of 
ability in science. These assumptions agree with the common 
sense view when the numbers are large; it may be only 
approximately true for the usual school class of 30 to 40 
pupils. Nevertheless, these assumptions are fundamental 
in the application of the scaling process to class marks. 

Scaling school marks is a two-fold problem. In the first 
place we must scale the marks of each class in the various 
subjects; the possible variation in standard and scatter of the 
raw marks for a particular class was illustrated in Chapter II, 
page 11. Before we can scale subject marks we must deter- 
mine what the mean mark for a particular class is to be and 
also what the standard deviation is to be. 

We have assumed that the scatter of marks in each subject 
for all classes is the same and so our second problem becomes 
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the fixing of a mean mark for each class. If in a school there 
are six first year classes, ТА, IB, ІС, 1D, IE and 1F, with 
differentiated schemes of work and examinations to suit the 
various ability levels, which class mean mark is to be 50? 
If class 1D has a class mean of 50 for all subjects, what will be 
the class mean for the other classes? 

This chapter deals with a method for assigning the class 
means and the standard deviations of the class marks. The 
next chapter demonstrates how the scaling is done in practice. 
We denote the class mean mark by the symbol A.M.M. 
(assigned mean mark). 

A. good system of assigning means should have certain 
characteristics which we shall deal with in turn. Firstly, it 
would link the class mean mark in all subjects to some 
criterion indicating the general “brightness” of the class. 
Two. criteria suggest themselves, (a) the mean class IQ and 
(b) the mean achievement mark in a promotion test. The 
mean class IQ is usually chosen as the more convenient index 
of brightness because IQs are on the same scale from year to 
year while actual achievement marks vary with the structure 
ofthetests. There is, of course, a high degree of correlation 
between these two sets of marks. 

Secondly, a good system of assigning class mean marks 
would fix a “Кеу mark” which would have some educational 
significance. For example, on the Scottish National Pupil's Re- 
cord Card there is an instruction that the standard of attain- 
ment reached in each subject will be represented by a percent- 
age mark, *50 being taken to represent attainment that is 
only just satisfactory." Obviously the question immediately 
arises, “ What type of pupil do the framers of this card have 
in mind?" Satisfactory attainment for a pupil in the senior 
secondary school is entirely different from satisfactory attain- 
ment in the junior secondary school. Again, satisfactory 
attainment in the “А” classes of the junior secondary school 
is not the same as in the “С” classes, 

Investigation has shown that a pupil of IQ 110 has a 
fifty-fifty chance of gaining the Senior Leaving Certificate. 
For this type of school a class with a mean IQ of 110 might 
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be given a mean class mark of 50. If such a school had 
classes with a mean IQ of 100, the marks in a class of this 
nature would be depressingly low and would be regarded as 
“unsatisfactory.” 

McClelland’s investigation showed also that a pupil with an 
IQ of 109 had a fifty-fifty chance of obtaining a Day School 
Certificate (Higher). If the mark of 50 were taken as the 
class mean mark for a class with mean IQ 109, the big 
majority of pupils in a Junior Secondary school fall into the 
category “unsatisfactory.” 

The third certificate mentioned in the McClelland investi- 
gation was the Day School Certificate (Lower) where the mark 
50 was intended to mean “ће just satisfactory completion of a 
two years’ approved course." It was found that a pupil with 
an IQ of 94 had a fifty-fifty chance of achieving success in 
this certificate. This gives a good basis on which to build a 
scheme for assigning class means as it was reckoned that 
about two-thirds of the pupils in a junior secondary school 
normally gained this certificate. We shall take as the 
educational significance of a mark of 50 “the just satisfactory 
completion of a three years'* course" and assume that two- 
thirds of the pupils attain this standard. 

The mean IQ ofa large group of pupils selected at random 
is generally assumed to be 100 and the standard deviation 15. 
What shall we take as the class mean mark of a class with 
mean IQ 100? Reference to Table 20 shows that the 
sigma mark which divides a normal frequency distribution 
approximately into two-thirds and one-third is at roughly 
0:52, from the mean. Hence if we make our class mean 
mark for a class with mean IQ 100 at 50 +0-5а, we һауе found 
what we are searching for. 

Experience has shown that whe 
academic—are taken into acco 
deviation of homogeneous classes is 
accepted, our class mean mark corres 
IQ of 100 will be 50 +3 х 10-55. 

* The school leaving age has been raised to 15 years since th 
of this certificate. 


n all subjects—practical and 
unt, the average standard 

10 and, if that value is 
ponding to a class mean 


e establishment 
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Having fixed the educational significance of the mark of 50 
and having chosen 10 as a suitable standard deviation, we 
must now find a formula for determining the assigned mean 
mark of any class. We shall assume that the interval 
between means of classes will be similar to the interval 
between their respective mean IQs. These intervals will not 
be equal because the measure of scatter of marks (10) is not 
equal to the measure for IQs (15) but these intervals should 
become equal if we express them in terms of their respective 
scatters or standard deviations. Since we know our key 
values—mean IQ of 100 corresponding to an assigned mean 
mark of 55— 

.А.М.М.-55 МАЛО - 100 
ы. 10 Y 15 
This gives us the scaling equation— 


A.M.M. =55 +0 (М.10 - 100) 


For example, if а class mean IQ is 112, then 
A.M:M. =55 + (112-100) 


=55 +8 
= 63 
It ought to be emphasised that this formula, notwith- 
standing the above attempt to rationalise it, is essentially 
empirical; it crystallised out of an investigation designed to 
test whether a scale of marks based on an assigned mean 
mark of 55 for mean IQ 100 and 65 for mean IQ 115 
would give marks in general agreement with current practice 
in a junior secondary school. It was found to work satis- 
factorily but we make no claim for this formula in other 
types of schools where the spread of ability is narrower. 
When the second criterion—the mean promotion . test 
score—is used, an almost identical value for the assigned mean 
mark isobtained. The following data refgr to recent entrants 
to a junior secondary school. The standard deviation for the 
County in the promotion test was 50. 


ASSIGNING CLASS MEAN MARKS ELT. 
Class Ta Ib Ic 
Mean IQ 113 | 100 91 
Calculated assigned mean mark | 64 55 49 
Mean test mark 151 107 72 
Assigned mean mark 64 55 48 


Calculation for class Ia— 
A.M.M. -55 +0 (151 - 107) 


=55 +88 
=64 
We сап also find out what percentage of the class is likely 
to fall below 50. 


Distance of 50 below class mean =63 – 50 
=13 ` 


From Appendix IV, —1:30c corresponds to the 9-7 per- 
centile, i.e., 9-7 per cent. would be below 50. 
The following gives data for typical classes:— 


Mean Assigned Percentage 

Group IQ Mean Mark below 50 
Typical A. 112 63 10 
REB 100 55 31 
ALC 90 48 58 


A third characteristic of a good system of assigning class 
means would be that the overlap between the marks given to 
typical A, B and C classes would not be too great. A system 
that allowed for too great an overlap would not attain its 
purpose and the drawing office episode might occur again. 


On the other hand, too small an overlap might make it appear 


as if A, B and C were different species instead of belonging to 


one continuous variation. The following warning from 
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Thorndike* makes this point:— It should be unnecessary 
to warn the reader against the absurdity of deliberately 
changing continuous variation into a few groups by coarse 
Scaling; next assuming that the central part of one of these 
coarse divisions really measures all the individuals therein, 
and finally imagining that because the continuous series 
varying from “а” to “a+b” has been called say, poor, 
medium, good and excellent, there are really gaps within it! 
Unfortunately even gifted thinkers are guilty of this error." 


TABLE 13 
Typical Distributions of Scaled Marks 


Group A B С | Total 
No. of classes 7 6 4 17 
Average assigned mean mark 64 55 48 57 
Class interval Frequency 
90-94 1 1 
85-89 6 6 
80-84 6 2 8 
75-79 20 3 23 
70-74. 29 7 36 
65-69 28 21 4 53 
60-64. 29 30 15 74 
55-59 41 34 13 88 
50-54 23 35 20 78 
45-49 10 29 22 61 
40-44 5 11 19 35 
35-39 1 8 1 20 
30-34 1 6 7 
25-29 3 3 
20-24 1 1 
Total 199 | 182 |113 | 494 
үс Below 50 8% | 27% | 52% | 26% 


It would be absurd, indeed, if the best “С? boy at his special 
aptitude subject were, by the incidence of a scaling procedure, 
prevented from rising as high in marks as the median boy 
in a typical *A" class. 

If we take the three classes indicated above as being re- 
presentative of their groups, in the normal case there will be 


* THORNDIKE, Educational Psychology, page 150, (Kegan Paul). 
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considerable overlap in the marks of the three classes. 
Twenty per cent. of the Bs will reach or exceed the mean of 
the As and 24 per cent. of the Cs will exceed the mean of 
the Bs. 

Table 13 shows typical distributions of scaled marks in 
A, B and C group classes. А study of this table shows the 
extent of the overlap of the marks between the three groups. 

The method of assigning class mean marks outlined above 
has proved to be successful in practice. Мо doubt, once the 
system of scaling has been widely adopted, some modi- 
fications will suggest themselves. If this formula or any 
other which is found to be more feasible were adopted in 
junior secondary schools, marks would be comparable from 
school to school. 


СНАРТЕК ХІ 


SCALING IN THE CLASSROOM 


Tue scaling procedure in the classroom involves transforming 

the teacher's raw examination marks into marks with an 

assigned mean and standard deviation. We have indicated 

in the previous chapter how to arrive at the assigned mean and 

that the assigned standard deviation for all subjects is taken 

as 10. қ 
Тһе scaling equation for this transformation would be 


X, -A.M.M. +19 (x - M) 
== 


х 
where Ха =scaled examination mark 
сх =Standard deviation of class marks 
X =class mark 

М =mean of class marks 
(X — M) is the deviation of a pupil's mark from the mean of 
the class marks; for example, where the class mean mark in 
arithmetic is 50, the pupil with a mark of 65 has a deviation 


of 15 from the mean. In the above equation mS may be 
Em 


x 
regarded as the adjusting ratio applied to the pupil's deviation 
from the mean. To find the scaled examination mark the 
adjusted deviation from the mean is added to the assigned 
mean mark. 

An adjusting Table (Appendix VIT) has been compiled to 
save the teacher's time and gives adjusted deviations for all 
cases that are likely to arise in the class scaling procedure. 

So far we have assumed that the mean and standard 
deviation have been calculated in the usual way. We feel, 
however, that such calculations may be considered too 
lengthy and that a suitable estimate of their values will have to 
be found to save teachers’ time even if some sacrifice of preci- 
sion hasto be made. Fortunately, as Garrett has pointed out, 
the virtual equality of theoretically more accurate and 
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theoretically less accurate methods is a familiar finding in 
mental measurements. * ЕР 

Та Chapter Ш we indicated that in а normal distribution 
Pis and Ps: cut off 16% of the cases from either end of the 
distribution; these percentiles are at a distance – Іс and +1с 
respectively from the mean. The range Рв: — Рас (to be 
named E) may be taken as an estimate of 2e and both 
theoretical considerations and investigations of actual class 
marks have shown that even with distributions not entirely 
normal, this range is а sufficiently good estimate of 2с for 
scaling purposes. 

There are three methods of classroom scaling, the first two 


_ based on different ways of calculating Ps, — Рав. 


METHOD 1—suitable for the case of whole classes of 25 
pupils and over and deemed to be passably normal. 
If the pupils’ papers are ranked in order of marks from the 
highest, the class rank corresponding to any percentile rank 
may be obtained in the following way:— 


к-- хм 05 
Class гап 100 xN + 


where x is the percentile and N the number of pupils. For 
example, the class rank corresponding to Pis in a class of 26 


pupils is 26 x26 +0-5 which is 4-7. Accordingly Pis lies 


about half-way between the scores of the fourth and fifth 
pupils from the bottom. То get the smoothed value—the 
value that will reflect the general trend of the marks in this 
region—we take the average of the scores of the 3rd, 4th, 5th 
and 6th pupils from the bottom. Similarly, to calculate Pss 
we take the average of the 3rd, 4th, 5th and 6th pupils from 
the top, and so with other cases. To save the teacher 
the trouble of calculation, Appendix VIII has been compiled 
giving class ranks for certain useful percentiles. 

The scaling procedure by Method 1 will be illustrated by 
transforming the 27 marks obtained in a history examination 
by a class whose assigned mean mark is 63. 


* GARRETT. Statistics in Psychology and Education, 2nd Edition, 1944, 
p 166, 3rd Edition, 1947, p 169, (Longman Green and Co.). 
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TaBLE 14 
History Examination Marks for 27 Pupils 


Order | Raw Marks ||Order| Raw Marks ||Order| Raw Marks 
1 85 11 56 21 46 
2 81 12 54 22 46 
3 78 13 53 23 40 
4 73 14 53 24 40 
5 65 15 52 25 34 
6 65 16 52 26 33 
7 63 17 51 27 31 
8 60 18 50 
9 59 19 47 

10 59 20 47 


Step l—Arrange pupils’ papers in order of merit, the 
highest scoring pupil being ranked 1. 

Step 2—Ascertain the assigned mean mark for the class 
according to the method outlined in Chapter X. In this 
example we are told that the A.M.M. is 63. 


Step 3—From Appendix VIII the class rank corresponding 
to Ра Ри is 4-8. Therefore the percentile scores lie between 
the fourth and fifth scores from the top and bottom. 

Step 4—Find the average of four marks, two on either side 
of Pa. This mark may be taken as the mark of the 84th 
percentile pupil. Count off the 3rd, 4th, 5th and 6th from 


the top, 78, 73, 65, 65. Ps:=70, the average of those four 
marks. 


Step 5—Find Pis in the same way. Рі-40, the average 
of 34, 40, 40 and 46. 


Step 6—Calculate the mean of the marks by taking the 
average of Ps, and Pig. 
Mean _ Ps + Pius _70 +40 L. 
2 2 
Step 7—Calculate the standard deviation с - зы 


55 


70-40 
в = 


2 
=15 
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Step 8—Find the adjusting ratio assigned sigma 
marks sigma 


adjusting ratio --- 
J ~ 15 


2 


3 
Step 9—Scale each raw mark in turn by finding its deviation 
from the mean, applying the adjusting ratio and adding to 
the A.M.M. 
For example, 85 15 +30 from the mean 55. Applying the 


adjusting ratio 30 ха —20. Hence the scaled mark becomes 
20 +A.M.M. which is 20 +63 =83. 


Similarly with the mark 81— 
Scaled mark =63 *3* 26 


=63 +17 
=80 
PRACTICAL AIDS 


1. Appendix VII gives the application of the adjusting 
ratio to the deviation from the mean for a wide range of 


Data SHEET I 
Мо. in ClasS............... АММ. 


From Table 25 rank for Р84/Р16- ........ 


P84 scores (1) P16 scores (1) 
(2) Q) 
(3) (3) 
(d) کے‎ (4) 
Sum Sum 
| 
Mean P84...... еннннн Mean P6... erre 


БИР 
10 


Adjusting ratio= EZ 
Р84--РІ6 | 


Mean score— 


84 THE SCALING ОЕ TEACHERS’ MARKS 


ratios and deviations. Reference to this table will save the 
teachers' time. 

2. It is advisable to keep a record of the calculations for 
each set of papers. This may be done by using Data Sheet 
I (see page 83) which should be attached to the examination 
papers. 

3. On each examination paper the following rubber stamp 
imprint is made:— 


Fic. XXI 
Rubber Stamp Imprint for Scaling Calculation 
Raw mark 
Mean 
Deviation 


Adjusted Dev. 
A.M.M. 


Scaled mark 


Once Data Sheet I has been completed it is a relatively simple 
job with the help of Appendix VII to complete the entries 
on each examination paper. 

The reader is invited to work out the scaled marks in the 
above example to give him an estimate of the time taken to 
scale the marks of a particular case. Тһе scaled marks are:— 

83, 80, 78, 75, 770, 70, 68, 66, 65, 65, 63, 62, 61, 
61, 61, 61, 60, 60, SE 57, 515 oh, 53, 53, 49, 48, 47. 


METHOD 2— suitable for half classes from 12 to 24 pupils 
whose marks are assumed to be distributed in passably normal 
fashion. Investigation has Shown that this method works 
well with “judgment” marks in domestic subjects, art and 
handcrafts. 

The calculation of Ps, and Р depends on what is known 
as the centroid property of the normal curve. It is 
known that the mean of the highest 38% of the scores in a 
normal distribution has a deviation of о from the mean and 
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thus to find Ps, we take the average of the top 38% of the 
marks. For example, in a class of 22, Ps: is obtained by 
finding the average of the top eight marks. Similarly Раб is 
given by the average of the bottom eight marks. Appendix 
IX gives the number of marks that have to be averaged from 
top and bottom to give an estimate of Ps: and Pio. 

The scaling procedure by Method 2 is illustrated by the 


following example:— 
TABLE 15 
Benchwork Marks of a Class of 15 Boys 


Order Mark Order Mark 

1 76 9 61 
2 70 10 58 
3 69 11 5 
4 68 12 54 
5 67 13 52 
6 66 14 51 
7 64 15 49 
8 61 


Step 1—Arrange examination papers in order of merit, 
the highest scoring pupil being ranked 1. 

Step 2—Ascertain the A.M.M. by the method outlined in 
Chapter X. In this case the A.M.M. is 58. 

Step 3—Estimate Ps, by averaging the required number of 
marks from the top. Тһе number of marks to be averaged 
is given in Appendix IX. қ 

From this table we note that six marks have to be averaged. 


/. Ра EU +70 +69 4-68 +67 +66) 


=69 
Step 4—Estimate Pic. 


Pis = 49 +51 +52 +54 +55 +58) 


=53 ' 
Step 5— Calculate standard deviation. 
Е Ра — Pio 


=о = 


Di 2 
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Step 6—Calculate the mean mark. 
Mean sey 


69 +53 _ 


Меап = 61 


assigned с 
marks o` 


Adjusting ratio -5 


Step 7—Find adjusting ratio— 


Step 8—Scale each raw mark by finding its deviation from 


the mean, applying the adjusting ratio and adding to the 
А.М.М. 


Mark 76 is 15 above the mean 


г. scaled mark = 58 +10 х15 


=77 


PRACTICAL AIDS 


1. Use Data Sheet II for calculations. Data Sheet II is 
shown completed for the above example. 


DATA SHEET И к 
No. іп class—15, A.M.M.—58. 
From Table 26 number of marks to be averaged for P84 and P16—6. 
P84 marks 1. 76 P16 marks 1. 49 
2. 70 2L 
3. 69 $ 5 
4. 68 4. 54 
5: 67 Sb 155 
6. 66 6. 58 
ss Ê 
8. 8. 
9; 9; 
Sum 416 Sum 319 
Ауегаве 69 Ауегаре 53 
а-Б-РМ-РІ6 16 , 
2 2 AE 
Mean mark— Berne = 61 
Adjusting ratio= 
=== 
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2. Print rubber stamp outline of Figure XXI on each 
examination paper. 

3. Table 24 gives the application of the adjusting ratio 
to the deviation from the mean. 


METHOD 3—suitable for cases where the distribution of ` 
marks is not thought to be normal, specially for half class 
subjects except those mentioned under Method 2. 

This method, which isa graphical one, is also based on the 
centroid property of the normal distribution and Method 2 
may be considered as a special case of this method. It has 
also the advantage that it makes use of all marks. 

The Data Sheet for Method 3 is as follows:— 


Data $нЕЕТ ІП 
No. in class... AMMO 
From Table 27. 
(ӘР (ӨР ТО) — (DE ТӨ) 
Scores A B (е) D E 
1 
2 
3 
4 
5 
Sums 
Means 


Sigma Deviations 


The completion of the above data sheet is best illustrated 
by an example. Take the case of 18 examination marks. 
These may be divided into 5 groups, 3, 4, 4, 4, 3, correspond- 
ing to percentages 1720022820222: Ш. If the groups are 
named A, B, C, D and E their means are at the following 
sigma distances from the mean 0. 

(3) (4) (4) (4) (3) 
А В C D E 


17% 22% 229, 22% 17% 
7-1.49 -0:590 0 +0590 +1-49т 
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Тһе graphical method employed is not the normal one but 
is known as the sigma chart. The vertical axis is marked off 
in the usual way but the horizontal axis has the zero in the 
centre and the units to the right and left are sigma units as in 
Figure XXII below. - 
Fic. XXII 
Horizontal Axis of Sigma Chart 


| | | | | | | 
— 30 -20 EST 0 F10 +20 +30 

The various steps in Method 3 are illustrated by the fol- 
lowing example :— 

Scale the 14 class marks, for which 61 is the A.M.M.:— 

33, 47, 48, 52, 52, 52, 53, 54, 55, 56, 56, 63, 67, 76. 

Step 1—Arrange pupils’ papers in order or marks, the 
lowest mark on top. 

Step 2—Ascertain the assigned mean mark as shown in 
Chapter X. In the above case A.M.M. is 61. 


Step 3—From Table 27 find the grouping of the marks ` 


and the sigma deviation of each group. For a class of 14 
this is— 


A B C D E 
3 4 4 3 0 
-Г37 -038 -038 4137 
li Data Sweet Ш 
No. in class—14. A.M.M.—61 


From Table 27. 


(3 (4) (4) (3) (0) 
Scores A B С D E 
1 33 32 54 63 
2 47 52 55 67 
3 48 52 56 76 
4 53 56 
5 


Sums 128 209 221 206 


Sigma Devs. - 1.37 —0.38 +0.38 +1.37 


Means 43 52 55 69 ? 
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Step 4—Complete Data Sheet III. 

Note that in this example there are no entries under E. 

Step 5—On a sheet of graph paper lay off a sigma scale 
along the horizontal axis 

-20, -1e,0, +10, +20 (1 sigma =2 ins.) 
' Step 6—Along the vertical axis lay off a scale of examina- 
tion marks. Repeat this scale at 0 in the sigma scale. Scale 
vertically 10 marks to the inch. (Figure XXIII). 

Step 7—Plot A.M.M. above 0 and (A.M.M. +10) above 
+1о and draw a straight line through them. This is the 
line of scaled marks. In Figure XXIII the points are P and Q 
where P is 61 above 0 and Q is 71 above +10. 

Step 8—Plot A, B, C, D and E above the sigma 
deviations. А is 43 above -1:37о, ie, (-1:37, 43), 
В (-0:38, 52), C (4038, 55), D (41:37, 69). Join these 
points by straight lines to form the line of marks. 

Step 9—Scale the raw marks by reading from the line of 
raw marks to the line of scaled marks. 

In Figure XXIII the line of arrowheads shows how the raw 
mark of 53 when scaled becomes 60. 

The following Table gives the results of scaling:— 


TABLE 16 
Raw and Scaled Marks for a Class ој 14 Pupils 


Raw Marks | Scaled Marks || Raw Marks | Scaled Marks 


33 37 54 62 
47 52 59 65 
48 53 56 65 
52 57 56 65 
52 57 63 71 
52 57 67 73 
53 60 76 80 


GRAPHICAL TECHNIQUE FOR METHODS ] and 2. 


Teachers find the rubber stamp technique quick and easy 
but those who are familiar with graphs prefer this method. 
“We shall illustrate this approach by the same example which 


was used in Method 1. 
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1. We assume that А.М.М., Ps: and Р have been cal- 
culated by the appropriate technique. In the example given : 


А.М.М.-63 
Ps4 =70 
Pis =40 


2. Lay off a sigma scale along the horizontal axis, 1 sigma — 
2inch. (Figure XXIV). 

3. Let the vertical axis represent examination marks. This 
scale should be repeated at the zero of the sigma scale. 
In Figure XXIV the vertical scale is 10 marks to the inch. 

4. Plot Рі vertically above —1e and Pa; above +1 апа 
draw a line through these two points—the line of marks. 
Pig is the point ( - 1e, 40) and Pss is ( +10, 70). 

5. Plot the A.M.M. above 0 and (A.M.M. +10) above 

+ le and draw a line through them—the line of scaled marks. 
In Figure XXIV the points are P (0, 63) and Q (+1, 73). 

6. Read from the line of raw marks to the line of scaled 
marks. In Figure XXIV we have illustrated the scaling of 
the raw mark 65 which becomes a scaled mark of 70 and also 
40 which scales to 53. 


CLASS AVERAGES 


Paragraph 699 of the Advisory Report on Secondary 
Education refers in the last sentence to what it calls “the usual 
invalid method" of totalling and averaging marks to find a 
pupil's general place in class. Scaled marks, of course, can 
be validly added; that is one of the purposes of scaling. А 
difficulty, however, arises when marks—scaled or unscaled— 
are totalled and averaged: the scatter of the marks shrinks 
considerably, the shrinkage increasing with the number of 
marks added. А way out is to apply Method 1 in order to 
restore the scatter to its original standard deviation of 10. 
Unless this is done an average of 50 is no longer “the lower 
limit of satisfactory performance" although the average was 
derived from marks in which 50 had that significance. 


MARKS 
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WE hope that many practising teachers and administrators 
will have stayed the journey thus far. Their reward for so 
doing will be a greater knowledge of the significance of 
examination marks but we would strongly assert that this 
knowledge can only be enriched if they put the scaling 
operation into practice. 

For practical reasons, examinations will continue to have 
their place. Inside the classroom examination marks are the 
normal means of communication between the teacher and his 
class and accordingly need not be scaled. They must be 


scaled when they have to be compounded, compared and in- 


terpreted in places outside the classroom, in the headmaster’s 


room, in the home, in the vocational guidance department, in 
the offices of prospective employers and in other schools. 
Furthermore, now that the great majority of pupils leave 
school without any externally accredited certificate, it is of 
first importance that the mark on the leaver’s card should be 
based on a planned system of marking: that means scaled | 
marks. We are convinced and we hope that those who have 
borne with us thus far are also convinced that the scaling 


process is essential to make many examination marks have a 


meaning. 

Perhaps readers will pardon 
educational philosophy which h 
book. Examinations form only a part of the educative 
process and not the most important part. Good taste, 
character and citizenship must all be cultivated in the schools 
and these do not submit to measurement. Again, despite 
advances in psychology, the human personality defies and in 
our opinion, will always defy exact measurement. It is in the 
interest of humanity that this should be so. 

Finally, we make no extravagant claims for our methods. 
There is still much to be investigated and we are sure that 
those who follow our suggestions will improve upon our 


93 


us if we re-state a part of our 
as earlier been written in this 
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techniques. We are at present experimenting with methods 
applicable to small numbers of pupils, with short cuts and 
approximations designed to lighten, without too great a loss 
of accuracy, the admittedly heavy labour of some methods 
and with special investigations of individual cases where the 
procedure has produced unusual results. 

This book was written at the request of many who had 
heard of our attempts at introducing scaling as a working 
system. We offer it as a contribution to the knowledge which 
experimental education has given to the teacher. Іп the last 
30 years or so, this branch of education has enabled teaching 
to be more effective, schools to be happier and the aims of 
education to be more thoroughly realised. We know only 
too well that in solving one problem we have created others: 


to solve the problem of to-day is to raise the problem of 
to-morrow. 


APPENDIX I 


EXAMPLES OF OBJECTIVE TESTS IN ENGLISH, LATIN AND 
MATHEMATICS 


These tests were applied to pupils of about 17 years of age who had completed 
a five years secondary course. 


ENGLISH 


A. Tn each of the following sentences оле word is wrongly employed. Under- 
line that word and, in the space provided at the end of the sentence, 
write the word that should have been used. 


(1) Late that night we saw à curious phenomena 
(2) Each of the girls came out in their turn...... 
(3) Neither men nor money were lacking for our plans. 
(4) He had seldom or ever scen such flowers............. 
(5) You have weakened instead of strengthened your case. 
(6) Her coat is as fine than but less expensive than mine. 
(7) The interest of the visitors centred round the old keep. 
(8) “Whom do you think you are?" she retorted.. 
(9) А good employer is one of those who 
employees 
(10) Nothing but acting and games interest hi 


В. Complete each of the following sentences by inserting in the blank space 
what you consider the most suitable preposition. 


his bank. — 


(4) Her geometry was good but she was we: . algebra. 


(5) He was an amiable man, slow . ger. ; 
(6) Political speeches are mainly the heads of children. 
(7) Although he was innocent, his name WAS слане a cloud. 
(8) . my part, I shall give you all the help I can. 

get a white elephant ............ one's hands. 

. nature à taciturn man. 


(9) T 
(10) He is 


C. Strength; condescension; cowardice; humour; weather; .frankness; 
friendship; endurance; darkness; revels; knowledge; cunning; stride; 


honesty; furnishings. 3 3 А 
From the above list of nouns choose that which will most suitably 


pair with each of the following adjectives. Write the chosen noun in 
the space provided after the adjective. 


1, Stygian ......... 2. Spartan 

3. Machiavellian 4, Halcyon 
5. Martial 6. Olympian 
7. Jacobean 8. Platonic. 
9. Bacchanalian 10. Rabelaisian 
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LATIN 


Read this passage, mentally translating as you read, then answer the 

questions which follow:— 
Multa super Lauso regitàt, multumque remittit 
qui revocent maestique ferant mandata parentis. 
at Lausum socii exanimem super arma ferebant 
flentes, ingentem atque ingenti vulnere victum. 
agnovit longe genitum praesaga mali mens. 
canitiem multo deformat pulvere et ambas 
ad caelum tendit palmas et corpore inhaeret, 
"tantane me tenuit vivendi, nate, voluptas, 
ut pro me hostili paterer succedere dextrae, 
quem genui? tuane haec genitor per vulnera servor 
morte tua vivens? heu, nunc miser mihi demum 
exitium infelix, nunc alte vulnus adactum! 


(a) State and explain the mood of "revocent" (verse 2) 


(b) State and explain the case of “mali” (verse 5). 


(c) His comrades brought Lausus home—exhausted, breathless, distraught, 
dead, wounded. Underline one of these words as your answer. 


(d) Of what does Lausus’ father accuse himself? 


(f) Scan verse 4:— 
flentes, ingentem atque ingenti vulnere victum. 


(g) What is the first action of the father, on learning what has happened? 


1. 


APPENDIX I 


MATHEMATICS 
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Attempt as many questions as possible. If you require to do any working, 
use the blank spaces provided for this purpose. 


(a) Find in square inches the area of a circle 


of radius 7 inches. (Take m= 


(b) Find, in c.c., the volume of a cone with 
base 6 sq. cm. and height 8 cm. 


. (a) Express M Cosec2A—1 as a trig. ratio 
of A. 


(b) If Sin?B = (1-х) (1--х), express x as а 
trig. ratio of B. 

(c) Give the values of :— 
i) Sin 90° 
(ii) Cos 180° 


Replace each of the following by a single trig. 
ratio:— 
(а) 2 Sin С. CosC. 
(p) “п x—tan y 
1-+-tan x tan y 


20 
(с) 1— tan; 
0 


1 чап 
(а) b?+c?—a? where а, b, с are sides of 
~- БОР triangle ABC. 


. (a) Find the two solutions for x:— 


ax?--(a?— 1) х-а=0 
(b) х24-ху | —121 Find the positive 
xy--y2— 4f values of (x+y) and x. 


. (а) Find the positive square root of 4+2 V3 


(Answer in surd form). 
(b) Find, in its HUE ГО the value of 
1 


3-Е уз 


Answers 
Атеа= 


Vol.— 


(a) 


(c) 
(d) 


1st root= 
2nd root= 


xty- 
х= 


(а) 


6) 
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TABLE 17 
Calculation of the Mean from Grouped Data 
Class 
Interval f d fd -G 
90-94 T 6 6 А=62 
85-89 5 о 
80-84 0 4 0 2fd_ -38 
75-79 1 3 3 М 1 
70-74 9 2 18 
65-69 11 1 11 =—0.54 
55 10 1 10 Жа 
55-59 = = = 
50-54 11 5) рео Mean A ес 
45-49 9 552 —27 
40-44 3 -4 E12 -62-0.54х5 
35-39 1 -5 -5 
=62-2.7 
N 71 +38 
—76 =59.3 
Ха – 38 


Step 1—Select an assumed or guessed mean, A, at the mid-point of the in- 
terval which is most likely to contain the actual mean. In 


the above example, A is at the mid-point of the class interval 
60-64, i.e., A—62. 


Step 2—In the column headed “4” indicate the number of intervals be- 


tween each interval mid-point and the assumed mean. Note 
that below the assumed mean the values are negative. 


Step 3—Multiply each frequency “f” by its corresponding “4” to give 
the products in column “Га,” 


Step 4—Find the algebraic sum of the fd products to give Dfd. 
Step 5—Divide Zfd by N, the total number of cases. 
Step 6—Multiply Ха by the size of the class interval, c. 


Step 7—Ада 20 xc algebraically to A to give the correct mean. 
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TABLE 18 
Calculation of the Standard Deviation from Grouped Data 


Class с-5 
Interval f d fd fa? A=62 
90-94 | 1 6| 6| 36 Ха -% 054 
85-89 | 0 5 ШІ Sa 
$9539 | 1| 3| 3| 9 lza 
5 Оа E 
е. 
5-і 2 
60-64 | 15 0 271—291 419 
5559 | 10 | “1| -10| 10 N 7 
50:54 | 1 | -2| -22| 44 Е 
дао кој Seen Ы o= [4/24 (2821 хе 
40-44 | 3 | -4|-12| 48 Мат 
Sep | it || ЕЕ || ж 
N 71 4-38 | 291 = [410-029] xe 
-76 =1.95х5 
54-38 =9.75 


Step 1—Complete the d and fd columns as in the calculation of the mean. 


Step 2—Multiply each fd by its corresponding d to give the products in the 
fd? column, 


Step 3—Find the sum of the fd? products, denoted by 2142, 
К жау, —38\2 afd? .. 291 
Si ысы КОМ (= (е, === 
tep 4—Find ( N ) ie. ( 7i ) and N i.e. 7i 
Step 5—The standard deviation is given by the formula— 
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TABLE 19 
Percentiles for given Standard Deviations 


S.D. Value | Percentiles |S.D. Value | Percentiles ||S.D. Value Percentiles] 
=33 05 =1.1 13.6 1.1 86.4 
-32 07 =1.0 15.9 1.2 88.5 
-3Л 10 = 9 18.4 13 90.3 
—3.0 14 = 18 21.2 1.4 91.9 
=29 19 = 17 24.2 1.5 93.3 
— 2.8 .26 — {6 27.4 1.6 94.5 
- 227) 35 = es) 30.8 1.7 95.5 
—2.6 47 — 4 34.5 1.8 96.4 
—2.5 .62 == 3. 38.2 1.9 97.1 
-2.4 82 - <р) 42.1 2.0 97.7 
=213 14 = 1 46.0 2.1 98.2 
222 14 0 50.0 22 98.6 
Sal 1.8 Ail 54.0 2.3 98.9 
—2.0 23 2 57.9 2.4 99.18 
— 1.9 2.9 al 61.8 2.5 99.38 
18 3.6 4 65.5 2.6 99,53 
ЕЗІ 4.5 25) 69.2 2.7 99.66 
— 1.6 55 6 72.6 2.8 99,74 
cu 6.7 ‚ 75.8 2.9 99.81 
— 1.4 8.1 .8 78.8 3.0 99.87 
= 1.3 9.7 19 81.6 3.1 99.90 
Le 11.5 1.0 84.1 32 99.93 

3.3 99.95 
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TABLE 20 
Standard Deviations for given Percentiles 


Percentiles S.D. Value || Percentiles |S.D. Value || Percentiles |S.D. Value 
1 - 3.090 31 - 496 70 + .524 
ГА — 2.878 32 — .468 71 + .553 
3 - 2.748 33 —.440 72 ae 583 
4 — 2.652 34 — 412 73 + .613 
5 - 2.576 35 —.385 74 + .643 
6 — 2.512 36 -.358 ЕШ2 + .675 
E — 2.457 37 —.332 76 4- .706 
8 - 2.409 38 —.305 Үй} + .739 
9 -2.366 39 -.279 78 Ч 772, 
1 -2.326 40 — 255 279 + .806 

2 — 2.054 41 —.228 80 +".842 
3 - 1,881 42 — .202 81 + .878 
4 — 1.751 43 —.176 82 + .915 

EXIT Sa 84 + .995 
7 — 1.476 46 -.100 85 4 1.036 
8 — 1.405 47 — .075 86 4-1.080 
9 — 1.341 48 —.050 87 4-1.126 
10 — 1.282 49 —.025 88 41.175 
11 — 1.227 50 — .000 89 +1.227 
12 — 1.175 51 +.025 90 41.282 
13 — 1.126 52 +.050 91 + 1.341 
14 — 1.080 53 +.075 92 + 1.405 
15 — 1.036 54 +.100 93 +1.476 
16 — .995 55 +.126 94 1.555 
17 — .954 56 +.151 95 +1.645 
18 — (915 57 +.176 96 +1.751 
19 — .878 58 +.202 97 +1.881 
20 — .842. 59 +.228 98 +2.054 
21 — .806 60 +.253 99 +2.326 
22 — 772 61 +.279 99.1 +2.366 
23 - 1739 62 +.305 99.2 +2.409 
24 — .706 63 +.332 99.3 +2.475 
25 = .675 64 +.358 99.4 +2.512 
26 - .643 65 +.385 99.5 +2.576 
27 - .613 66 +.412 99.6 +2.652 
28 - .583 67 +.440 99.7 +2.748 
29 - .553 68 +.468 99.8 +2.878 
30 - .524 69 +.496 99.9 +3.090 
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APPENDIX VI 
\ ARITHMETICAL METHOD OF SCALING 
(a) For use where the number of pupils is relatively small 
Тніѕ method which involves a direct application of the Scaling equation (see 


page 36) is best illustrated by an example. In Table 21 below, column 2 shows 
the teacher's estimate in English for each of the 20 pupils in the class: column 


4 shows the mark obtained by the pupil in the external examination on which 
the marks are to be scaled. 
TABLE 21 
Pupil Estimate | Square Mark Square 
1 40 1600 47 2209 
2 45 2025 54. 2916 
3 65 4225 66 4356 
4 50 2500 41 1681 
5 60 3600 59 3481 
6 65 4225 74 5476 
7 80 84 7056 
8 35 1225 40 1600 
9 42 1764 49 2401 
10 45 2025 54 2916 
11 48 2304 56 3136 
12 70 4900 72 5184 
13 45 2025 49 2401 
14 45 2025 56 3136 
15 60 3600 78 6084 
16 40 1600 43 1849 
17 60 3600 66 4356 
18 35 1225 32 1024 
19 40 1600 41 1681 
20 60 3600 65 4225 
20 1030 56068 1126 
(п) (EI) (E2) (61) (52) 


The steps in the calculations are as follows:— 
Step 1—Enter in column 3 the square of each estimate, 
Step 2—Enter in column 5 the square of each mark. 


Step 3—Add columns 2, 3, 4 and 5 to find the values of E1, E2, SI and S2, 
The number of pupils (n) is given at the foot of column 1 


Step 4— Calculate the average estimate, uan ЕТЕ T0308 51.5 


20 
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Step 5— Calculate the average mark, i.e., SL 56.3 


„51 


52-51х-- 
Step 6—Calculate the scaling ratio = к кен у 
N во-вахЕ! 


= „ /61168- 1126563 
56068— 1030 X 31.5 
=1.12 


As а check calculate the alternative form 


= 4/20267168= (1120) 
20 x 56068 — (1030) 
=1.12 


Step 7—The scaling equation now is 
à Ts— 56.3 +1.12 (T— 51.5) 


where T= unscaled estimate and Ts— scaled estimate as before. 


Step 8—Construct from the equation the scaling table 22 shown below. 


TABLE 22 
Scaling Table 
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Notes: This method has the advantages of being straightforward and of 
using all the available information. It has the disadvantages of involving 
heavy calculation but the labour is greatly lightened if an adding machine, or 
better still a calculating machine, is available. Tables of squares are useful 
and the preparation of the scaling table is made easier by the use of tables such 
as Crelle's which give multiples of three figure numbers, e.g., 1.12 in the above 


example. 
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(b) For use where the number of candidates is larger. 


Where the number of candidates is greater than 20, it is often speedier to 
group estimates and marks. Тһе method will again be illustrated from the 
data of the previous example, and the steps are as follows:— 


(1) Group the estimates and marks in five point intervals. Inspection of 
the estimates shows that they are mostly multiples of 5. А desirable 
set of intervals is therefore 38-42, 43-47, 48-52, and so on, with 40, 45, 
50,... as the midpoints. 
Тһе frequencies of the estimates and marks in each of these intervals 
are shown in columns 1 and 5 of Table 23 below. 


(2) Calculate the mean and standard deviation of each distribution by the 
method shown in Appendices II. and Ш. Do not evaluate each 
square root at this stage, The calculations are shown in Table 23. 


(3) Complete the scaling equation as shown in the table. 
Ts=56.3+1.14 (T—51.5), which differs only slightly 
viously obtained. 


(4) Construct the scaling table. 


Тһе equation is 
from that pre- 
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TABLE 23 
Calculation of Scaling Equation from Grouped Data 
Estimates Marks 3 
Class 1 2 3 - 5) 6 2 8 
Interval | f d fd fd2 f d fd fd? 
83-87 0 — — = 1 6 6 36 
78-82 1 5 5 25 1 5 5 25 
73-77 0 4 0 0 1 4 4 16 
68-72 1 3 3 9 1 3 3 9 
63-67 2 2 4 8 3 2 6 12 
58-62 4 1 4 4 1 1 1 1 
53-57 0 0 0 0 4 0 0 0 
48-52 2 —1 —2 2 2. —1 -2 2 
43-47 4 -2 -8 16 2 —2 -4 8 
38-42 4 —3 | -12 36 3 —3 -9 27 
33-37 2 —4 —8 32 0 —4 0 0 
28-32 0 — — -- 1 -5 -5 25 
20 --16 132 20 --25 161 
—30 -20 
—14 5 
fd _—14_ ПИ а E 
TE Bo, 5м =з С? 
fd? 132 5 fd? 161 
к =-===6. ==>-==8.0. 
B= 690 Бо 50 78:05 
Меап =55—5 х0.70 Меап=55+5 х 0.25 
=51.5 =56.3 
в=5\/6.60—0.49 в=5\/8.05—0.06 
=5\/ 6.11 =5\/7.99 
А 5199 _ 
Scaling ratio ————— —4/1.307 —1.14 
5 /6.11 У 


Scaling equation:—Ts=56.3+1.14 (Т—51.5) 
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TABLE 24 


Adjusting Table 


To be applied to deviations 


10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 

6|7,8|9]|ir|12|13/14| 15 | 16 | 17 | I8 | 19 
Rp v ы ose ДИС УИ SE SERRE 
2. 3] РЕЗ 9 ed vx eer PR] ли ти ти |І 
82125174) cds 373. | 24] 22 || 2.) 251 22): 2.) 2 
Beal Wa О ЕА Риз а зоо 
Dea MERI 616051 а 5 | 8.) 31335 
ЕО ЕЕЕ ЕЕ ЕРА 41747 1 зз 
7. 121101 98| 6| 6| 5| 5| 5| 4| 4| 4| 4 
ВАЗ ТИ ГӘ 77| m$ 675 51 s I4 4 
9. | 15 |13 | 11 |10| 8| 7| 7| 6| 61 6| 5| 5| 5 
103 |17/14|13| 8n | 9| 8| 8| 7| 7| 6| 6| 6| № 
11. |18| 16 14 | 12 TO 9| 8| 8| 7| 7| 6| 6| 6 
12.) 20} 17/15] 13 | 1 |10| 9| 9| 8| 8| 7 7| 6 
13. 22 19 16 | 14 | 12 | 11 | 10] 9| 9| 8| 8| 7| 7 
14. | 23 | 20 | 18 | 16 | 13 | 12 | 11 | 10] 9| 9| 8| 8| 7 
15. | 25 | 21 | 19] 17 | 14 | 12 | 12 | 11 | 10] 9] 9| 8| 3 
16. | 27 | 23 | 20 | 18 | 15 | 13 | 12 | 11 | 11 | 10 .9|9| 8 
17. | 28 | 24 | 21 | 19 | 15 | 14 | 13 | 12 | 11 | 11 |10| 9| 9 
18. | 30 | 26 | 23 | 20 | 16 | 15 | 14 | 13 | 12 | 11 | 11 | 10] 9 
19. | 32 | 27 | 24 | 21 | 17 | 16 | 15 | 14 | 13 | 12 | 11 | 11 | 10 
20. | 33 | 29 | 25 | 22 | 18 | 17 | 15 | 14 | 13 | 13 | 12 | 11 П 
21. | 35430 | 26 | 23 | 19 | 17 | 16 | 15 | 14 | 13 | 12 | 12 11 
22. | 37 | 31 | 28 | 24 | 20 | 18 | 17 | 16 | 15 | 14 | 13 12 | 12 
23. | 38 | 33 | 29 | 26 | 21 | 19 | 18 | 16 | 15 | 14 | 14 13 | 12 
24. 34 | 30 | 27 | 22 | 20 | 18 | 17 | 16 | 15 | 14 | 13 | 13 
25. 36 | 31 | 28 | 23 | 21 | 19 | 18 | 17 | 16 | 15 | 14 | 13 
26. 37 | 33 | 29 | 24 | 22 |20 | 19 | 17 | 16 | 15 | 14 | 14 
27. 39 | 34 | 30 | 25 | 22 21 | 19 | 18 | 17 | 16 | 15 | 14 
28. 40 | 35 | 31 | 25 | 23 | 22 | 20 | 19 | 18 | 16 | 16 | 15 
29. 41 | 36 | 32 | 26 | 24 | 22 | 21 | 19 | 18 | 17 | 16 | 15 
30. 43 | 38 | 33 | 27 | 25 | 23 | 21 | 20 | 19 | 18 | 17 | 16 
31. 39 | 34 | 28 | 26 | 24 | 22 | 21 | 19 | 18 | 17 | 16 
32. 40 | 36 | 29 | 27 | 25 | 23 | 21 | 20 | 19 | 18 | 17 
33. 41 | 37 | 30 | 27 | 25 | 24 | 22 | 21 | 19 | 18 | 17 
34. 43 | 38 | 31 | 28 | 26 | 24 | 23 | 21 | 20 | 19 | 18 
35. 32 | 29 | 27 | 25 | 23 | 22 | 21 | 19 | 18 
36. 33 | 30 | 28 | 26 | 24 | 23 | 21 | 20 | 19 
37. 26 | 25 | 23 | 22 | 21 | 19 
38. 24 | 22 | 21 | 20 
39. 23.|-22: | 21 


106 


APPENDIX VIII * 


TABLE 25 


Percentiles and Corresponding Class Ranks 
Example: Ina class of 24 the pupil at P84 is ranked as 4.3, counted from the 


top, and the pupil at P16 is ranked as 4.3, counted from the bottom. 


Мо. in| P84/ | P95/ | P75/ Хо. inl P84/ P75/ 
Class | P16 | P5 P25 | P50 jÎ Class | P16 P25 
20 3.7 1.5 5.5 | 10.5 34 5:9 22 9.0 
21 3.9 1.6 58 | 11 35 6.1 23 9.3 
22 4.0 1.6 6.0 | 11.5 36 6.3 2.3 9.5 
23 4.2 17 63 | 12 37 64 | 24 9.8 
24 43 1.7 6.5 | 12.5 38 6.6 | 2.4 | 10.0 
25 4.5 1.8 6.8 | 13 39 6.7 2.5 | 10.3 
26 4.7 1.8 110 | 13:5 40 69 | 25 | 005 
27 4.8 19 73 |14 41 71 2.6 | 10.8 
28 5.0 1.9 7.5 | 14.5 42 7.2 | 2.6 | 11.0 
29 5.1 2.0 7.8 | 15 43 74 | 27 | 11.3 
30 5.3 2.0 8.0 | 15.5 44 7.5 ЛЕМА, 
31 55| ЕП 8.3 |16 45 7Л1| 218 | 118 
32 5.6 2.1 8.5 | 16.5 46 17,9) 2.8 | 12.0 
33 5.8 22 8.8 |17 47 8.0 | 29 | 123 
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The ‘numbers given below М (the number of papers) tell the number of 
шз that paye to be averaged from top and bottom to give an estimate of 
P84 and P16. 


TABLE 26 
Number of Marks for P84 and P16 


N 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 
№. of scores} 44 5 5 56667788899 
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Adjusted average (promotion stage), 60-1 
Adjusting ratio, 80, 83, 86 
Adjusting table, 106 
Advisory Council's report on Secondary Education in Scotland, 5, 46, 62 
Arithmetical method of scaling, 38, 102-4 
Arithmetical probability paper, principle of, 24 
Assigned mean mark, calculation of, 76 
Assigning class mean marks, basis for, 75-6 
characteristics of system of, 75-9 
Assumptions, fundamental to class scaling, 73 
Averages, shrinkage of scatter, 61, 91 


Borderline cases (promotion stage), 61 
Burt, Sir Cyril, 12, 29 


Card, qualifying data, 59 
Certificate, Day School (Higher), 75 
Day School (Lower), 75 
Senior Leaving Certificate, 41, 46, 48, 62-5, 67 
scaling procedure applied to, 62-3, 67-9 
School, 62 
Class average, 6, 7, 10-1 
Class interval, 13 
lower and upper limits of, 18-9 
Class ranks from percentile ranks, calculation of, 81 
table of conversion, 107 
Class marks, scaling methods, 81-92 
Comparability of marks between classes, how achieved, 73-6 
Cumulative frequency, 17-8, 42, 51 


Data sheets, 83, 86-7 

Diagrammatic representation of, frequency distribution (score scatter), 14 
distribution of test marks, 8, 9 

Director of Education, 7, 9, 34 

Distribution, normal frequency, characteristics of, 20-1 


E (Range Pg4-P16), 81, 82, 85 
Equation, for assigning class mean marks, 76 
for scaling class marks, 80 
scaling, 36 
teachers’ estimates, scaling of, 37 
Equivalent scores, illustrated by thermometer, 29-32 
Estimates, teachers', 7 
characteristic features of (L.C. stage), 70-1 
function of scaling (L.C. stage), 64-5 
scaled, help to teacher, 47 
scaling of, 35, 37-45 
' scaling, fundamental principles of, 37; (L.C. stage), 69-70 
scaling procedure (L.C. stage), 67-8 
Scaling small groups, 67 
varying school standards (L.C. stage), 63-4 
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Examinations, interval v. external, 1, 2 
researches on, 2, 3 ) 
objective type, advantages and disadvantages of, 3 


Formula, conversion of temperature scales (C., Fah.), 30 
percentile, 18 

Frequency, 13 

Frequencies cumulative, 17-8, 42, 51 

Frequency distributions, 13 


Garrett, H. E., 80, 110 

Graphical conversion of temperature scales; 31-2 
Graphical scaling of class marks, 91-2 
Graphical scaling of teachers' estimates, 39, 41-4 


Graphical scaling of teachers’ estimates, disadvantages of Method 1,41 


Histogram, 14-5 
of normal IQs, 20 


Interpercentile ranges, 17 
Interquartile range, 16-7 
IQs in promotion battery, 49 


Mark, assigned class mean mark, 74-6 
examination, meaning of, 5, 7 
pass, at L.C. stage, determination of, 65-6 
pass, at promotion stage, 58-9 

Marking scales, fixed points and units of, 35 


Marks, achieving comparability of, between subjects, 73 
combining, 8-10 


comparing, 8-10 
differences of mean and scatter of class subject marks, 10 
measures of scatter of, 16-7 
Taw, 10, 49, 59 
scatter of, 7-10, 35, 42 
standardised, 49-56 
McClelland, W., 39, 57, 58, 61, 75 
Mean, approximation to, 28, 82, 86 
as measure of standard, 16, 35 
calculation of, 16, 98 
measuring from, 21 


Normal frequency curve, area divided at ic intervals, 22 
centroid property of, 84-5, 87 
relationship between standard deviation and area of, 22 
calculation of scores corresponding to percentiles, 23-4 


Normal frequency distribution, characteristics of, 20-1 
of 105, 20 


Objective tests, examples of, 95, 96, 97 

Ogive, 24-5, 52 

Order of merit, not affected by scaling, 38, 70 
Overlap of groups of classes, 77-8 


Pass mark, at L.C. stage, determination of, 65-6 
at promotion stage, 58-9 
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Percentile, calculation of, with grouped data, 18-9 
corresponding to upper limit of class interval, 19 
corresponding sigma (standard) scores, 22-3, 100-1 
curve, 24-6 
definition of, 17 
formula, 18 
line, 56 
ranking (rating), 42, 51, 53 

Percentile scaling, contrasted with sigma scaling, 47-8 
principles of, 41-2 

Permille paper, 24-7, 44-5, 68 
advantages of, 24, 54-5 
principles of, 24, 54 

Promotion scheme, operation of, 59-61 
outline of, 58-9 


Quartiles, lower and upper, 16-7, 40 


Range(s), 16 
interpercentile, 17 
interquartile, 16-7 
Ranking pupils, 12-3 
Ratio adjusting, 80, 83, 86 
scaling (adjusting), 36 
Record Card, Scottish National, pupils, 74 


Report of Advisory Council on Secondary Education, 5, 46, 62 


Report to parents, 5-7 


Scale, standardised, advantages of, 50 
Scaled estimates, help to teacher, 47 
possibility of negative values, 38 
Scaled marks, help to outside bodies, 93 
Scaling, arithmetical (sigma) method, 38, 102-4 
choice of method, 48 
effect on order of merit, 38, 63 
equation, 36 
equation for teachers’ estimates, 37 
estimates, first graphical method, 39 
second graphical method (percentile), 41-5 
on IOs, criticism of, 46-7 
subject on subject, 37, 46, 70 
fundamental principles, 35, 37, 69-70 
percentile method, principles of, 41 
ratio, 36 
sigma and percentile, differences between, 47-8 
small classes, 47, 67 
Scatter, measures of, 10, 16-9 
Scatter of marks, 7-10, 35, 42 
effect on weight of mark, 9 
Scores equivalent, illustrated by the thermometer, 31-4 
Score scatter, 14-5 
Sigma chart, 88, 90, 92 
Sigma scaling, 38 
contrasted with percentile scaling, 47-8 
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Standard deviation, ( o), 19, 53 
approximation to by range Pes-Pig, 24-8 
calculated from grouped data, 99 
definition of, 19 . 
and weight of test, 57-8 

Standard, measure of (mean), 16 
of marking, 35, 42 
Scores, 36 

Standardisation, by Permille paper, 54-6 
definition of, 49 
scale, reasons for choice of fixed point and unit of, 49-50 
worked example of, 50-56 

Standardised marks, 49 

Summary mark (promotion stage), 57 


Tables, reference, 100, 101 
Temperature, B.U.T. (big unit of temperature) as basis of standard ог common 
scale of, 30-2 
conversion formula, 30, 35 
scales, conversion of, to illustrate scaling process, 29-34 
graphical conversion, 31-2 
Tests, Intelligence, Moray House, 57 
objective type, advantages and disadvantages of, 3 
at L.C. stage, 66 
at L.C. stage, Specimens, 95-7 
promotion, battery of, 57 
Weight of, depending on standard deviation, 8, 9, 57 
Thorndike, E. L., 78 


Transfer of pupils, basis of selection, 57 
outline of scheme, 58-9 
Weight of test, depending on standard deviation, 8, 9, 57 


Zig-zag, 24 
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